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ABSTRACT 


Title  of  Dissertation:  MULTIFRACTAL  INTERNET  TRAFFIC  MODEL 

AND  ACTIVE  QUEUE  MANAGEMENT 


Jia-Shiang  Jou,  Doctor  of  Philosophy,  2003 


Dissertation  directed  by:  Professor  John  S.  Baras 

Department  of  Electrical  and  Computer  Engineering 


We  propose  a  multilevel  (hierarchical)  ON / OFF  model  to  simultaneously  cap¬ 
ture  the  mono/multifractal  behavior  of  Internet  traffic.  Parameter  estimation 
methods  are  developed  and  applied  to  estimate  the  model  parameters  from  real 
traces.  Wavelet  analysis  and  simulation  results  show  that  the  synthetic  traffic 
(using  this  new  model  with  estimated  parameters)  and  real  traffic  share  the  same 
statistical  properties  and  queuing  behaviors.  Based  on  this  model  and  its  statis¬ 
tical  properties,  as  described  by  the  Logscale  diagram  of  traces,  we  propose  an 
efficient  method  to  predict  the  queuing  behavior  of  FIFO  and  RED  queues.  In 
order  to  satisfy  a  given  delay  and  jitter  requirement  for  real  time  connections,  and 
to  provide  high  goodput  and  low  packet  loss  for  non-real  time  connections,  we  also 
propose  a  parallel  virtual  queue  control  structure  to  offer  differential  quality  of 
services.  This  new  queue  control  structure  is  modeled  and  analyzed  as  a  regular 


nonlinear  dynamic  system.  The  conditions  for  system  stability  and  optimization 
are  found  (under  certain  simplifying  assumptions)  and  discussed.  The  theoretical 
stationary  distribution  of  queue  length  is  validated  by  simulation. 
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Chapter  1 


Introduction 


With  the  rapid  development  of  communication  and  networking  technologies  in 
the  last  three  decades,  the  Internet  has  become  the  largest  artificial  system  in  the 
world.  It  is  one  of  the  most  important  and  quickest  media  for  delivering  information 
nowadays.  One  may  receive  and  send  various  kinds  of  information  such  as  data, 
voice  and  video,  in  the  form  of  email,  web  page,  Internet  phone/conference  and 
online  radio/TV.  The  life  style  of  human  beings  has  been  greatly  changed  by 
this  diversity  of  information  delivered  and  associated  information  processing  and 
utilization.  However,  the  current  usage  of  the  Internet  is  far  beyond  its  original 
design  envelope  and  causes  many  operational  and  performance  problems.  In  this 
dissertation,  we  focus  on  traffic  modeling  and  active  queue  management  policies 
to  improve  the  performance  of  the  Internet. 

1.1  Motivation 

Recent  studies  [1]  [2]  [3]  [4]  [5]  [6]  [7]  [8]  on  Internet  traffic  have  shown  that  the  ag¬ 
gregate  traffic  driven  by  TCP  based  protocols  such  as  HTTP  is  not  only  monofrac- 
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tal  (self-similar)  but  also  multifractal.  Wavelet  analysis  demonstrates  that  the  In¬ 
ternet  traffic  is  monofractal  at  large  time  scales  (5-10  minutes  and  larger),  which 
is  mainly  due  to  the  heavy-tailed  distribution  of  hie  sizes  transferred  over  the  In¬ 
ternet  [9]  [10]  [11]  [12].  However,  traffic  behavior  at  small  time  scales  is  much 
more  complicated  and  has  been  shown  to  be  multifractal  [2]  [13]  [14]  [15].  This 
multifractal  behavior  is  primarily  due  to  protocol  dynamics  such  as  TCP  how 
control,  network  congestion,  packet  loss  and  packet  retransmission.  Taqqu  and 
Willinger  [16]  explained  the  monofractal  behavior  at  large  time  scales  by  aggre¬ 
gating  a  large  number  of  independent  ON/OFF  hows.  The  ON  and/or  OFF  du¬ 
ration  in  their  model  has  a  heavy-tailed  distribution,  which  corresponds  to  the 
total  transmission  time  of  a  single  hie  and  the  user  think  time  respectively.  They 
proved  that  the  aggregate  traffic  converges  to  the  well-known  fractional  Brownian 
motion  [17]  [18]  asymptotically  when  the  number  of  hows  goes  to  infinity.  They 
also  found  a  simple  relationship  between  the  shape  parameter  of  the  heavy-tailed 
distribution  and  the  Hurst  parameter  of  self-similarity.  However,  the  single  level 
ON/OFF  model  [19]  [20]  [21]  [11]  [22]  [23]  [24]  is  unable  to  explain  the  multifractal 
behavior  found  at  small  time  scales  with  its  constant  rate  assumption  in  the  ON 
duration.  On  the  other  hand,  Ricdi  proposed  the  Multifractal  Wavelet  Model  to 
capture  the  second  order  statistical  behavior  at  all  time  scales  [25]  [26].  Like  other 
cascade  models  [27]  [28]  [14],  these  cascade  models  are  unable  to  explain  the  ob¬ 
served  traffic  behavior  by  simple  operational  network  mechanisms.  Furthermore, 
these  cascade  models  need  typically  many  parameters  to  ht  the  statistical  behavior 
of  real  Internet  traffic.  Therefore,  it  is  important  to  provide  a  more  precise  model 
that  can  capture  traffic  behavior  at  all  time  scales  and  better  explain  the  relation¬ 
ship  between  observed  traffic  statistical  properties  and  natural  and  simple  network 
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operational  mechanisms.  One  of  the  goals  of  this  dissertation  is  to  provide  such  a 
model  for  the  Internet  traffic. 

In  addition,  we  are  interested  in  network  control  and  performance  enhancement. 
Most  analyses  of  queuing  behaviors  [29]  [30]  [4]  [31]  has  been  concerned  with  infinite 
or  large  buffer  size.  The  effect  of  short  range  dependence  (SRD)  on  the  traffic  is 
absorbed  by  such  a  large  buffer  size.  As  a  result,  the  analysis  result  of  queuing 
behavior  is  dominated  by  long  range  dependence  (LRD)  phenomena  [32]  [33]  [30] 
[17]  [34]  [35]  [36].  However,  the  actual  buffer  size  in  a  real  router/switch  is  finite 
and  small.  In  this  situation,  the  queue  can  not  retain  the  “memory”  for  a  long  time. 
On  the  other  hand,  short  range  dependence  in  the  traffic  also  plays  an  important 
role  in  queuing  behavior.  Wavelet  analysis  provides  a  convenient  way  to  extract 
important  statistical  properties  from  the  traffic.  For  example,  the  Logscale  diagram 
of  the  wavelet  analysis  provides  the  second  order  statistical  properties  of  the  traffic 
at  all  time  scales  (SRD  and  LRD).  This  background  motivated  us  to  propose  a  new 
approach  to  describe  traffic  and  predict  the  corresponding  connection  performance 
such  as  the  delay  and  delay  jitter. 

The  buffer  management  policy  at  the  bottleneck  routers  determines  the  connec¬ 
tion  quality  directly.  The  drawbacks  of  the  drop-tail  policy,  such  as  long  queuing 
delay  and  low  link  utilization,  for  Internet  traffic  were  well-known  [37]  [38]  [39]  [40]. 
In  order  to  improve  network  performance,  many  buffer  management  policies  such  as 
RED  (Random  Early  Detection)  [41]  [42]  [43]  [44],  BLUE  [40],  FQ/WFQ  (Weighted 
Fair  Queue)  [45],  CHOKe  [46]  have  been  proposed  and  evaluated.  Among  them, 
RED  attracted  the  most  attention  in  the  held.  According  to  the  TCP  protocol,  the 
TCP  connection  decreases  its  flow  rate  dynamically  if  packets  are  lost.  The  basic 
idea  of  RED  is  to  maintain  the  queue  length  within  a  given  region  by  randomly 
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dropping  packets  among  the  various  connections  before  the  buffer  is  overflowed. 
The  dropping  probability  is  an  increasing  function  of  queue  length.  A  connection 
with  a  higher  rate  has  a  higher  risk  to  lose  packets  and  reduce  its  flow  rate.  Since 
the  queue  length  is  controlled  and  kept  within  a  desired  region,  link  bandwidth  is 
fully  utilized  and  the  packets  experience  smaller  mean  delay  and  delay  variation. 

However,  a  great  portion  of  Internet  traffic  belongs  to  web  traffic  (HTTP)  and 
voice/video  streams  (UDP).  Most  of  web  pages  contain  several  small  hies  and  create 
a  short  but  bursty  bandwidth  requirement.  As  a  consequence,  the  TCP  protocol 
of  web  connections  is  primarily  operated  in  the  slow  start  phase.  Dropping  packets 
during  this  phase  cannot  effectively  control  the  congestion  level  at  the  router,  but 
greatly  increases  the  mean  and  variance  of  hie  delivery  delay.  Since  the  connection 
quality  of  real  time  applications,  such  as  web  and  voice/video  traffic  is  sensitive 
to  the  mean  and  variance  of  delay,  these  considerations  motivated  us  to  propose  a 
parallel  virtual  queue  control  structure  to  serve  real  time  and  non-real  time  traffic 
separately,  and  adaptively. 

1.2  Contributions 

The  contributions  of  this  dissertation  are  in  three  areas.  First,  we  developed  a 
new  multilevel  hierarchical  model  of  Internet  traffic  traces,  using  simple  oper¬ 
ational  patterns  of  packets  and  sessions.  As  such  the  principal  model  and  its 
derivative  models  can  be  used  in  a  variety  of  situations  (open-loop  (i.e.,  without 
how  control),  closed-loop  (i.e.,  with  how  control)),  with  different  how  and  conges¬ 
tion  control  schemes  (i.e.,  not  just  TCP),  and  in  modeling  traffic  at  different  layers 
(network  or  application  layer).  Second,  we  developed  queue  behavior  estimates 
and  predictions  of  performance  for  FIFO  queues  fed  by  such  traffic  traces.  We  also 
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developed  estimates  of  delay  and  delay  jitter.  With  these  prediction  results,  one 
can  allocate  network  resources  dynamically  to  guarantee  differential  quality  of  ser¬ 
vice  requirements.  For  example,  in  order  to  provide  differential  quality  of  service 
in  the  DirecPC  system,  the  bottleneck  router  (Satellite  Gateway)  in  the  network 
operation  center  has  high  and  low  priority  queues  for  different  connections.  Our 
prediction  method  can  help  the  network  manager  determine  how  much  bandwidth 
should  be  allocated  to  each  queue  so  that  their  quality  of  service  requirements  can 
be  satisfied  respectively.  Third,  we  developed  a  new  parallel  queue  control  scheme 
that  treats  bursty  and  non-bursty  (real-time)  traffic  differently  for  higher  perfor¬ 
mance  of  bottleneck  nodes.  We  also  investigated  analytically  the  stability  of  the 
system. 

The  first  and  second  areas  and  contributions  are  more  connected  and  interre¬ 
lated  than  the  other.  The  second  and  third  areas  address  queue  control  problems 
from  different  perspectives.  The  connection  between  the  first  and  third  areas  is 
that  the  qualitative  characterization  of  traffic  in  the  first  provided  the  inspiration 
for  the  scheme  in  the  third. 

The  multifractal  behavior  of  Internet  traffic  at  small  time  scale  is  mainly  con¬ 
tributed  by  the  TCP  flow  control  mechanism.  The  corresponding  connection  pa¬ 
rameters  such  as  round-trip  time,  TCP  session  lifetime  and  active  time  of  burst 
play  an  important  role  of  traffic  behavior.  Based  on  these  network  parameters 
estimated  from  real  traces,  we  propose  a  key  traffic  model  with  a  multilevel  hier¬ 
archical  ON / OFF  structure  for  Internet  Traffic.  The  key  idea  is  to  simultaneously 
emulate  the  packet  arrival  pattern  (operational  pattern  and  characterization)  in 
a  typical  Internet  session  at  small  time  scales  and  the  user  behavior  (operational 
characterization)  at  large  time  scales.  We  first  estimate  model  parameters  such  as 
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round-trip  time  and  active  time  of  each  connection  from  real  Internet  traces.  The 
synthetic  traffic  trace  is  generated  by  the  proposed  model  with  the  estimated  pa¬ 
rameters.  The  statistical  behavior  of  both  synthetic  and  real  traffic  are  compared 
by  employing  wavelet  analysis  techniques.  We  demonstrate  that  this  new  model 
precisely  captures  the  statistical  behavior  of  real  Internet  traffic  at  all  time  scales. 
In  addition,  many  network  simulations  demonstrated  that  both  real  and  synthetic 
traffic  traces  also  display  a  similar  queuing  behavior  when  the  First  In  First  Out 
(FIFO)  queue  control  policy  is  applied.  The  steady  state  distributions  of  qnene 
length  in  these  FIFO  qnenes  by  synthetic  and  real  traces  match  as  well  within  a 
wide  range  of  utilization  conditions. 


Figure  1.1:  Methodology 

Since  the  Logscale  diagram  of  wavelet  analysis  of  traffic  traces  extracts  the 
second  order  statistics  of  traffic  traces  at  all  time  scales,  we  provide  a  numerical 
approach  to  obtain  the  Logscale  diagram  from  the  model  parameters,  instead  of 
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analyzing  a  long  duration  trace.  Furthermore,  we  develop  a  fast  algorithm  to 
predict  the  steady  state  queue  length  distribution  of  such  a  FIFO  queue  using 
parameters  computed  from  the  trace  Logscale  diagram.  An  upper  bound  of  buffer 
overflow  probability  is  also  derived  when  the  buffer  size  is  finite  and  given.  Figure 
1.1  depicts  how  the  trace  analysis,  traffic  models,  connection  parameters,  queue 
length  distribution  are  interrelated  and  used.  These  results  are  motivated  from 
and  apply  to  network  gateways  between  heterogeneous  network  domains. 

To  address  the  problems  of  buffer  management  policy,  we  first  demonstrate 
that  the  performance  of  RED  is  severely  degraded  by  bursty  web  traffic.  The 
dropping  probability  of  RED  is  very  sensitive  to  instantaneous  bursts  and  causes 
the  web  connections  to  see  a  high  instantaneous  packet  loss  rate.  This  high  packet 
loss  rate  forces  the  web  TCP  connections  to  enter  the  congestion  avoidance  phase 
prematurely,  thus  leading  to  a  small  TCP  window  at  the  sender  and  low  goodput. 
These  effects  result  in  a  long  delivery  delay  for  small  hie  transmission.  In  order  to 
prevent  the  bursty  short-life  connections  (mostly  operated  in  the  slow  start  phase) 
from  disturbing  the  RED,  we  propose  a  parallel  queue  control  structure  and  apply 
the  RED  and  drop-tail  buffer  management  policies  respectively  to  the  two  queues. 
The  proposed  policy  is  validated  and  evaluated  by  simulation  experiments.  This 
structure  preserves  the  advantages  of  both  RED  and  drop-tail  policies  such  as  high 
link  utilization,  low  loss  rate,  small  packet  delay  and  low  delay  jitter,  in  our  new 
str  uct  ure/schehe . 

Since  the  real  time  and  non-real  time  traffic  are  served  at  different  queues,  we 
have  more  freedom  to  control  the  connection  rate.  For  the  non-real  time  traffic, 
connection  goodput  and  low  packet  loss  rate  are  more  important  than  packet  delay. 
Instead  of  changing  the  dropping  probability  to  control  the  how  rate,  we  propose  a 
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dynamic  threshold  algorithm  to  control  the  flow  rate  by  queuing  delay  so  that  the 
packet  loss  rate  can  be  limited  and  kept  within  a  given  region.  The  main  advantage 
of  our  approach  is  that  it  keeps  the  average  packet  loss  rate  within  a  range  of  very 
small  values  so  that  the  average  TCP  window  size  at  the  senders  has  a  large  value 
in  its  congestion  avoidance  phase.  Hence,  any  congestion  at  the  router  can  not 
cause  another  bottleneck  at  the  TCP  sender  end. 

Finally,  we  model  and  analyze  this  queue  control  system  as  a  regular  nonlinear 
dynamical  system.  The  conditions  for  system  stability  are  found  and  stability 
results  are  proved.  We  also  develop  a  numerical  approach  to  obtain  the  queue 
length  distribution  for  the  dynamic  thresholds  case.  We  also  provide  a  linear 
approximation  approach  with  a  small  perturbation  assumption.  Both  theoretical 
results  are  validated  by  simulations. 

1.3  Organization 

The  arrangement  of  this  dissertation  is  as  follows.  In  Chapter  2,  we  briefly  intro¬ 
duce  the  wavelet  analysis  method  for  Internet  traffic  and  show  the  mono/  multi- 
fractal  behaviors  of  real  traces.  In  Chapter  3,  we  describe  the  proposed  Internet 
traffic  model  involving  network  parameters.  We  also  develop  parameter  estima¬ 
tion  methods  to  obtain  these  model  parameters  from  real  traces.  In  Chapter  4,  a 
synthetic  traffic  trace  is  generated  with  the  proposed  model  and  estimated  model 
parameters.  We  compare  the  statistical  properties  of  these  synthetic  traces  with 
the  corresponding  ones  from  real  traces  by  wavelet  analysis  (Logscale  diagram  and 
multifractal  spectrum).  Simulation  experiments  are  used  to  demonstrate  the  simi¬ 
larity  of  queuing  behaviors  in  FIFO  queues  fed  by  synthetic  and  real  traffic  traces. 
In  Chapter  5,  we  demonstrate  the  vulnerability  of  Adaptive  RED  to  bursty  web 


traffic  and  propose  a  parallel  virtual  queue  control  structure  for  buffer  management 


in  a  router.  The  system  model  and  performance  is  given  in  Chapter  6 
concludes  this  dissertation  and  describes  directions  for  future  work. 


.  Chapter  7 
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Chapter  2 


Preliminaries 

In  this  chapter  we  provide  a  brief  introduction  to  wavelet  analysis  of  traffic  traces, 
self- similarity  and  multifractal  processes. 

2.1  Wavelet  Analysis 

Wavelet  analysis  is  a  multiresolution  analysis  (MRA)  method  and  tool,  which  has 
been  widely  used  in  signal  processing  and  data  analysis  [47]  [48].  It  has  remarkable 
advantages  in  analyzing  stochastic  processes  with  long  range  dependence  [49]  [50] 
[51]  [52]  [53]  [54]  [55].  For  instance,  wavelet  analysis  can  eliminate  the  effect  of 
deterministic  trends  hidden  in  random  processes  if  the  wavelet  function  is  chosen 
properly.  We  now  introduce  the  multiresolution  analysis  in  L2(M). 

Definition  2.1.1  Multiresolution  Analysis  [56]  [57]  A  multiresolution  analy¬ 
sis  consists  of  a  sequence  of  closed  subspaces  Vj,  j  G  Z,  of  L2(M)  satisfying 

Vj  C  Vj- 1  Vj  G  Z  (2.1) 

feVj~  f  (2*(j)  G  Vj-i  Vj  G  Z  (2.2) 
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/  e  Vo  ->  /(•-*))  €  Vo  »teZ  (2.3) 

n^= (°}i  <2  4> 

j£Z 

U^-  =  L2(R);  (2.5) 

jez 

There  exists  a  function  €  V0,  such  that 
{4>{-  —  k)  :  k  G  Z}  is  an  orthonormal  basis  for  V0.  (2.6) 


The  function  cf  in  (2.6)  is  called  the  scaling  function  of  the  given  MRA. 


Figure  2.1:  Subspaces  of  multiresolution  analysis 

Remark  2.1.2  Riesz  basis  Eq.(2.6)  implies  that  —  k)  :  k  e  Z}  is  a  Riesz 
basis  for  Vq.  That  is,  for  every  f  G  Vo  there  exist  a  unique  sequence  {atk}k&z  such 
that 

f(t)  =  ^ak(p(t-k)  (2.7) 

kez 

with  convergence  of  the  series  understood  in  L2(M)  sense. 
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Remark  2.1.3  Let  4>j,k  =  2”-7//20(2 ~H  —  k).  According  to  (2.2)  and  (2.6),  {4>j,k  ■ 
k  G  Z}  is  an  orthoriormal  basis  for  Vj . 

Now,  we  introduce  the  construction  of  orthonormal  wavelets  from  MRA.  Let 
IL'o  be  the  orthogonal  complement  of  Vo  in  VI  y  that  is,  V_i  =  Vo  ©Wo-  According 
to  Definition  2.1.1,  we  have 

Vj^  =  Vj@Wj,  Vie  Z.  (2.8) 

Since  V3  — >  {0}  as  j  — >  oo, 

OO 

=  =  0114,  Vi  e  Z.  (2.9) 

n=j 

Since  V,-  — >  L2( Z)  as  j  — >  —  cxd, 

OO 

L2(M)  =  0  Wn.  (2.10) 

n=— oo 

To  find  an  orthonormal  wavelet  with  a  given  scaling  function  <f>,  all  we  have  to 
do  is  to  find  a  function  4  €  Wo  such  that  {4(-  —  k)  :  k  €  Z}  is  an  orthonormal 
basis  for  Wo-  Daubechies  [57]  had  shown  the  existence  of  orthonormal  wavelets 
with  compact  support  (i.e.  finite  duration  waveforms)  in  the  following  theorem: 

Theorem  2.1.4  [56]  [57]  For  any  integer  n  =  0, 1,2, ...  there  exists  an  orthonor¬ 
mal  wavelet  4  with  compact  support  such  that  i]  has  bounded  derivatives  up  to 
order  n.  Moreover,  i]  can  be  obtained  from  an  MRA  whose  scaling  function  <f  also 
has  compact  support  and  the  same  smoothness  as 

Many  wavelet  families  such  as  Haar,  Daubechies,  Meyer,  Morlet,  Mexican-hat, 
Battle-Lemarie,  Shannon,  etc.  [56]  [58]  [59]  [57]  have  been  eveloped  for  different 
purposes.  We  apply  these  useful  results  to  analyze  the  statistical  properties  of 
Internet  traffic  traces. 
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We  next  give  a  simple  example  to  explain  the  procedure.  Assume  our  target 
function  X(t)  is  defined  in  the  interval  [0,1)-  Let  the  scaling  function  be  0(f)  := 
1{0  <  t  <  1},  which  forms  a  basis  for  the  subspace  V0.  Since  the  subspace  V3 
are  all  nested,  i.e.  V3  C  V)_i,  and  formed  by  orthonormal  bases  in  Remark 

2.1.3,  the  basis  at  level  j  —  0  should  be  expressed  in  terms  of  the  basis  at  the  finer 
next  level  j  =  —1.  According  to  (2.2)  we  have 

0(f)  =  0(2 f)  +  0(2t  -  1) 

=  2-1/20_ljO(t)  +  2_1//20_11(f).  (2.11) 


Let 


X{t)(j)jik{t)dt 


(2.12) 


be  the  coefficient  of  0j,fc(f)  for  all  j,  k  G  Z.  We  have  the  projection  of  X(t)  at  the 
subspace  V3: 

OO 

Xj(t)  Proj3(X(t))  —  ^  ai,k(t>j,k{t)-  (2.13) 

k=— oo 

Now,  we  define  the  detail  (or  residual)  process  of  X(t )  at  level  j  as 


U(i)  :=  Xj-tf)  -  Xj(t). 


(2.14) 


Note  that  Y3{t)  G  W3  by  definition. 

Our  goal  here  is  to  find  a  basis  for  the  subspace  W3 .  It  is  obvious  that 


ao,o  — 


'  — OO 

rl 


X(t)4>(t)dt 

X(t)dt 


(2.15) 


0-1,0  — 


X(f)0_iiO(f)df 
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r  1/2 

=  21/2  /  X(t)</>(2t)dt 

Jo 

=  21/2  /  X(t)dt 

Jo 


0-1,1  — 


=  21/2  /  X(t)(t)(2t  -  l)dt 

J 1/2 

=  21/2  [  X{t)dt 
J 1/2 


and 


Oo,o  — 


0-1,0  +  o_i,i 

“71 


According  to  (2.11),  (2.14)  and  (2.18),  we  have 


Y0(t)  =  X_i(t)-X0(t) 

=  [a_ i,o0-i, 0(£)  +  o_i,i0_i,i  (t)]  —  oq,o0(^) 


0-1,0  —  0-1,1  | 


Ai,0(t)  -  0_i,i(t)] 


We  can  rewrite  Yq(0  =  7),o0(0  for  0  <  t  <  1,  where 


and 


do,o 


0(t)  :  = 


0-1,0  ~~  0-1,1 

72 

0-1,00)  -  0-1, i(t) 

72 

=  0(20  -  0(2 1  -  1) 


(2.16) 


(2.17) 

(2.18) 


(2.19) 

(2.20) 


(2.21) 


can  form  an  orthonormal  basis  for  Wo.  0 ( • )  is  called  the  mother  wavelet  of  the 
MR  A. 

Given  the  scaling  function  0  and  the  corresponding  mother  wavelet  0,  the 
discrete  wavelet  transform  of  the  continuous  time  process  X  (t)  is  formally  defined 
as  follows: 
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Definition  2.1.5  Discrete  Wavelet  Transform  [50]  Given  the  scaling  function 
f  and  the  mother  wavelet  fj,  the  approximation  coefficients  a3^  and  detail  coeffi¬ 
cients  dj^  of  the  discrete  wavelet  transform  of  process  X(t)  are  defined  as  follows 


where 


^j,k 


'  — oo 
roo 


X(t)(f>j>k(t)dt 

X(t)^jtk(t)dt, 


(f>j,k(t)  ■=  2  j/20(2  H  -  k) 
■=  2~j/2^(2  ~3t-k). 


(2.22) 

(2.23) 


(2.24) 

(2.25) 


The  functions  (b3j-  and  t[3j-  form  an  orthonormal  basis  for  V3  and  W3 ,  respectively. 
The  X(t)  has  the  following  representation 

OO 

*w  =  E«  oo,fc0oo,/c(^)  EE  (2.26) 

k  j=— oo  k 

This  simple  example  we  just  introduced  is  called  ‘Haar  wavelets’. 


Definition  2.1.6  Haar  Wavelet  [56]  If 


and 


f{t)  = 


1, 

0, 


( 

i]{t)  =  < 


1, 

-1, 

0, 


if  0  <  t  <  1, 
otherwise, 

if®<t<\, 
if\<t<  1, 
otherwise, 


then  if  is  an  orthonormal  wavelet  for  L2(M).  This  is  called  the  Haar  wavelet  and 
{'f’j'k  ■  j,k  G  Z}  is  an  orthonormal  system  in  L2(M)  and  is  shown  in  Figure  2.2. 
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Figure  2.2:  Scaling  function  (f>(t)  and  mother  wavelet  if(t)  of  Haar  wavelets 


Remark  2.1.7  Since  <f>j,k(t)  and,  ifj} kit )  of  Haar  wavelets  are  orthonormal,  the 
coefficients  a^k  cmd  dj ^  have  the  following  relation: 


%-l,2fc  +  Oj  — l,2fc+l 

a‘*  ~  y? 

A  _  a>j-  1,2k  —  %-l,2fc+l 

aj,k  ~  J= 


(2.27) 

(2.28) 


Theoretically,  the  projection  can  be  performed  from  j  =  — oo  to  oo.  However,  in 
practice  we  have  a  finite  index  from  j  =  0  to  J  and  we  only  consider  the  subspaces 

Vj  c  H/_i  c  ...  c  VQ. 

For  a  discrete  time  process  X%,  i  =  0,1,2,...,  the  discrete  wavelet  transform 
can  be  implemented  by  the  fast  pyramidal  algorithm  [60]  [61].  To  understand 
the  behavior  of  the  traffic  X%,  we  are  more  interested  in  the  detail  process  of  the 
discrete  wavelet  transform  dj In  the  next  section,  we  briefly  introduce  the  long- 
range  dependence  property  of  Internet  traffic  and  its  self-similar  behavior. 
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2.2  Self-Similarity  and  Long-Range  Dependence 


It  is  well  known  that  Internet  traffic  is  self-similar  and  has  a  long-range  dependent 
property  [9]  [62]  [10].  Many  studies  [63]  [64]  have  also  shown  that  this  long-range 
dependence  property  indeed  plays  an  important  role  in  network  performance. 

Let  T  be  either  M,  M+  or  {t  :  t  >  0}. 

Definition  2.2.1  Self-Simiarity  [65]  The  real-valued  process  X(t)  is  self-similar 
with  index  H  >  0  if  for  alia  >  0,  the  finite- dimensional  distributions  of  {X(at),t  G 
T}  are  identical  to  the  finite- dimensional  distributions  of  {aHX(t),t  G  T};  i.e.,  if 
for  any  k  >  l,  ti,  t2,  ■■■■,tk  G  T  and  any  a  >  0, 

[X(at1),X(at2),.,.,X(atk)]  =  [aHX(h),  aHX(t2), ..,  aHX(tk)].  (2.29) 

Where  X  =  Y  denotes  that  r.v.  X  and  Y  have  identical  distributions. 

Definition  2.2.2  Long-Range  Dependence  [66]  A  stationary  finite-variance 
process  Xt  displays  long-range  dependence  with  parameter  a  if  its  autocovariance 
function  R{k)  :=  E[(Xi  —  EXf){Xi+k  —  EXi+k)\  satisfies 

R(k )  ~  Crk a~1  as  k  oo,  (2.30) 

where  0  <  a  <  1  and  Cr  is  a  positive  constant. 

This  also  implies  that  the  corresponding  spectral  density  S( u)  :=  5X{R{k)} 
satisfies 

S(u)  ~  Cf\u\~a  as  uj  — ■>  0,  (2-31) 

where  C*  —  ttfti — r^T — ttt  and  T  denotes  the  Gamma  function. 

J  2Y{l—a.)sinyKOL/2) 

Leland  et  al.  [62]  have  indicated  that  the  Internet  traffic  is  long-range  depen¬ 
dent  and  has  a  self-similar  behavior  with  Hurst  parameter  0.5  <  H  <  1.  The 
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Internet  traffic  observed  at  different  time  resolutions  has  similar  statistical  prop¬ 
erties  and  this  phenomenon  cannot  be  modeled  well  by  traditional  traffic  models 
such  as  Poisson  and  Markovian  processes.  A  well-known  mathematical  process  for 
modeling  a  self-similar  process  is  the  so-called  fractional  Brownian  motion  process. 

Definition  2.2.3  fractional  Brownian  motion  [29]  The  fractional  Brownian 
motion  {BH(t),t  €  M}  is  a  Gaussian  process  with  zero  mean  and  autocovariance 
function: 

Cov(BH(ti),  BH(t2 ))  =  \{\h\2H  +  \h\2H  -  |*i  -  t2\2H}VarBH(l).  (2.32) 

Remark  2.2.4  A  fractional  Brownian  motion  with  VarBn(  1)  =  1  is  called  stan¬ 
dard  fractional  Brownian  motion. 

According  to  definitions  2.2.1  and  2.2.2,  it  is  obvious  that  the  fractional  Brow¬ 
nian  motion  is  self-similar  and  has  a  stationary  increment.  The  increment  process 
of  fractional  Brownian  motion  is  called  fractional  Gaussian  noise.  In  Chapter  3, 
we  introduce  the  properties  of  fractional  Gaussian  noise  process  and  compare  it 
with  our  traffic  model. 

There  are  many  estimation  methods  proposed  to  estimate  the  Hurst  parameter 
of  a  self-similar  process  such  as  R/S  analysis,  variance  time-plots,  periodogram 
analysis,  Whittle  estimator  [62],  [67].  However,  a  self-similar  process  is  strongly 
autocorrelated  and  displays  a  long-range  dependence.  This  long-range  dependent 
property  results  in  a  severe  estimation  bias  and  a  difficulty  in  estimator  conver¬ 
gence.  Wavelet  analysis  is  able  to  avoid  this  problem  by  choosing  the  scaling 
function  (and  the  corresponding  wavelet  function)  properly  [68]. 

According  to  the  wavelet  construction  [56],  the  mother  wavelet  i[)(t)  is  a  band¬ 
pass  function  between  and  uj2  in  the  frequency  domain,  where  oj\  and  u2  are 
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the  lower  and  upper  cut  off  frequency  of  Therefore,  the  detail  coefficient  dj ^ 
can  be  treated  as  the  output  process  of  the  corresponding  bandpass  filter.  The 
square  of  the  detail  process  d2k  roughly  measures  the  amount  of  energy  around 
the  time  t  =  2J/cA  and  the  frequency  2_Ju;o,  where  A  is  the  unit  time  interval  and 


.  . _  UJ1+U2 

^0  •-  —3—  ■ 


Proposition  2.2.5  [66]  If  a  stationary  finite-variance  process  A*  has  long  range 
dependence  with  parameter  a,  then  the  corresponding  detail  coefficients  dhk  have 
the  following  property: 

E[d\]  «  [  S(io)\^{2-j/2'i[(2-H)}\2duj 


'  2~i u>2 


'  2~i bj\<\ixi\<2~i (jJ2 


S(uj)2j\^(2juj)\2duj 

Cf\uj\-a2j\^(2juj)\2duj 


=  2  jaC 


f 


\oj\  a\'i>(yu)\2du 


'  U\<\uj\<UJ2 

=  2  jaCfC(a,iP). 


(2.33) 


Note  that  CfC(a,,i[ )  is  independent  of  the  variable  j. 

This  property  suggests  that  the  parameter  a  can  be  estimated  by  the  slope 
of  the  log2E[d2]  v.s.  j  plot.  This  plot  is  named  the  Logscale  diagram.  One 
advantage  of  wavelet  analysis  is  that  even  when  the  original  process  X,;  has  long 
range  dependence,  its  wavelet  transform  dj ^  still  has  short  range  dependence  if 
the  number  of  vanishing  moments  N  of  the  mother  wavelet  ip{t)  is  chosen  large 
enough  ( N  >  a/2). 


Definition  2.2.6  [69]  The  number  of  vanishing  moments  N  of  the  mother  wavelet 
i/j(t)  is  defined  as: 


Jtkip(t)  =  0,  k  =  0,1,2,...,X-  1. 


(2.34) 
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Proposition  2.2.7  [57]  [66]  If  the  number  of  vanishing  moments  N  >  a/2,  then 
dj:k  is  stationary  and  no  longer  exhibits  long  range  dependence  but  only  short  range 
dependence,  i.e.,  dj ^  is  quasidecorrelated  [70]  to  each  other. 

E[dj,kdj>,k/\  ~  |2 jk  -  2j'k'\a-l-2N ,  (2.35) 

as  23 k  —  23'k'\  — >  oo,  where  j  ^  j'  and  k  ^  k' .  This  implies  the  higher  N  is,  the 
smaller  the  correlation. 

In  order  to  estimate  the  parameter  a  of  long  range  dependence,  one  may  apply 
linear  regression  to  estimate  the  slope  of  the  Logscale  diagram. 

Definition  2.2.8  Weighted  Linear  Regression  [71] 

Given  a  sequence  of  independent  variables  ( Xj,yj ),  j=l,2,...,  the  hypothesis  of  linear 
regression  is  Ey.j  =  bx3  +  a.  The  unbiased  estimator  ( b,a )  of  (5,  a)  is 


r  _  T,  yj(Sxj  -  Sx)/a] 

CO  _  C2 

xx  & X 

Y  Uj{Sxx  -  SxXj)/a 2 

( 7  —  - — 

cc  _  02 

(2.36) 

where 

s  = 

sx  =  J2xi/ai 

(2.37) 


and  a]  is  an  arbitrary  weight. 

If  yj  are  mutually  independent  then  the  covariance  matrix  is 
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Cov(b,  a)  = 


-sx 


-sx 


SSXX-SZ  ssxx-sz 


and  the  correlation  coefficient  is  r  = 


-sx 

Vss^ 


Note  that  the  minimum  variance 


unbiased  estimator  ( MVUE )  is  achieved  if  we  set  cr2=Var(yj). 

Note  that  the  Logscale  diagram  is  a  log-log  plot  and  the  logarithm  is  not  a 
linear  operation 


E\og2(d2)  ±  log 2(Ed2)  =  ja  +  log2(CfC{a,i/>)),  (2.38) 

where  d2  :=  J2k=i  dj,k-  We  are  n°t  aPP^y  tffis  linear  regression  directly. 

Veitch  and  Abry  [68]  [51]  [55]  developed  an  asymptotically  unbiased  and  efficient 
joint  estimator  for  the  parameter  a  and  C(a,i/>).  They  also  provide  a  closed- form 
expression  for  the  covariance  matrix  of  the  estimator  and  show  its  accuracy.  In 
their  work,  y3  is  rewritten  as 

Vj  ■=  l°g2{d2)  —  Qj-, 


where 


9j  ■=  ^(nJ-/2)/ln2-log2(ni/2) 

The  regression  problem  becomes  Ey.j  =  ja  +  log2  CfC(a,  ip).  The  estimator  of 
slope  a  is  obtained  by  performing  the  above  weighted  linear  regression  problem 
with  Xj  =  j  and  the  weight  a2  =  Var{yj). 

Figure  2.3  is  the  Logscale  diagram  of  a  real  traffic  trace  from  the  bottleneck 
router  of  DirecPC  system.  The  total  length  of  this  trace  is  about  one  hour  with 
minimum  time  resolution  one  millisecond.  The  slope  at  large  time  scales  is  a 
constant  which  demonstrates  that  the  real  traffic  is  self-similar  (monofractal)  at 
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large  time  scales.  However,  the  slope  at  small  time  scales  has  severe  different  slope. 
It  indicates  that  the  traffic  is  a  multifractal  process. 

The  Logscale  diagram  not  only  demonstrates  the  long  range  dependence  of 
traffic  but  also  extracts  the  second  statistics  at  every  time  scales.  We  will  employ 
this  property  of  wavelet  analysis  to  provide  an  efficient  algorithm  for  predicting 
queuing  behaviors. 
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2.3  Monofractal  and  Multifractal  Processes 

The  basic  idea  of  multifractal  analysis  is  from  the  large  deviation  principle  (LDP). 

Theorem  2.3.1  Large  Deviation  Principle  [25]  [72]  Let  be  an  arbitrary 
sequence  of  random  variables  on  a  sequence  of  probability  spaces  with  probability 
Pn,  and  let  an  — >  oo.  Assume  that  the  following  limit  exists 

— -logEn[exp(qZ{n))}  ->  c(q)  (2.39) 

and  c(q )  is  finite,  concave  and  differentiable.  Then 

—  log Pn[—Z(n)  c*(a)  :  =  inf (qa  -  c(q)), 

dn  &71  q 

as  A  {a} 

where  En  is  the  expectation  w.r.t.  Pn. 

Consider  a  normalized  cumulative  traffic  Y(t),  0  <  t  <  1,  with  Y(0)  =  0  and 
T(l)  =  1-  Let  Xkn  :=  Y{(kn  +  l)2“n)  -  Y(kn 2~n),  kn  =  0,  l,...,2n  -  1,  denote 
the  normalized  traffic  increment  in  the  interval  [kn2~n,  ( kn  +  l)2~n).  The  LDP  is 
employed  by  defining  Z^J  :=  log Xkn  and  an  :=  n  log  2. 

c(q)  ■■=  lim  1  log  En[exp(qZ(n))\ 
n—>oc  n  log  2 

2n  — 1 

~  -ff  log2{^  5]  (expf^;^)9} 

kn=  0 
2n  — 1 

=  vlog2{^^A"»}’  (2-40) 

kn=  0 

as  n  — >  oo.  If  the  c(q )  exists  and  is  differentiable  for  all  gel.  According  to  the 
LDP,  we  have 

1  1q  #{kn  =  0, 1, ..,  2ra  -  1  :  a  -  e  <  akn  <  a  +  ej 
n  log  2  2n 
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1 

n 


log2  P[ati  G  [a 


—  e,a  +  £]]  — >  c*(a ) 
as  n  — >  oo,  £  — >  0, 


where 

akn  ■■= —log2Xkn.  (2.41) 

n 

In  other  words,  the  probability  of  Xkn  «  2”n“  is  approximately  equal  to  2nc*("b 
Note  that  c*(a)  is  a  non-positive  real  function  and  the  multifractal  spectrum  /(a) 
is  defined  as: 


/(a)  :=  1  +  c*(a), 


(2.42) 


and  2n-h“)  is  interpreted  as  the  “frequency”  of  a  certain  value  of  the  Holder  (or 
called  singularity)  exponent  a.  The  idea  is  easy  to  understand  by  the  following 
interpretation  [50].  Define  the  Partition  Function 


T{q) 


2n  — 1 


lim  - log2  E  Xl 

n— >  oo  — n  '  Kn 


kn= 0 


-1  +c(q). 


(2.43) 


We  have 

2n— 1 

V  xl  >  V  X,9  «  V  (2~n“)9 

/  v  rvn  /  -/  «vn  /  j  x  7 

kn=0  a.i~a  ai~a 

r a  2nf(a')  (2~qna)  =  2_n('?a_4(«)) 


(2.44) 


Take  the  logarithm  on  both  sides,  T(g)  <  qa  —  f(a )  and  /(a)  <  qa  —  T(q).  Since 
this  holds  for  all  a  and  q,  we  have 


T{q)  <  inf  (qa  -  f(a)) 

a 

f(a)  <  inf  (qa  -  T(q)). 
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(2.45) 

(2.46) 


Thus  the  multifractal  spectrum  f(a)  can  be  obtained  by  the  Legendre  transform 
of  T(q).  This  method  is  called  “increment-based  multifractal  analysis”. 

In  order  to  have  the  numerical  advantage  of  the  wavelet  analysis  [27]  [73], 
one  may  define  the  “wavelet-based  multifractal  analysis”  by  analyzing  the  detail 
process  dj.k  instead  of  the  increment  X^n  in  (2.41).  Define 

an(t)  ’■=  —  log2(2n/2|d-n)fc|).  (2.47) 

—n 

It  is  shown  [74]  that  under  some  mild  conditions,  this  approach  captures  the  same 
behavior  of  the  multifractal  spectrum.  Figure  3.6  shows  the  wavelet-based  multi¬ 
fractal  spectrum  of  a  real  traffic  trace.  We  will  apply  this  powerful  tool  to  evaluate 
our  model. 
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Chapter  3 


Multilevel  ON/OFF  Model 

3.1  Motivation 

According  to  wavelet  analysis  of  real  Internet  traffic  trace,  the  Logscale  diagram  in 
Fig.  2.1  is  shown  to  be  monofractal  (self-similar)  at  large  time  scales.  However,  the 
traffic  behavior  at  small  time  scales  is  more  complicated  and  regarded  as  multifrac¬ 
tal.  The  well  known  ON/OFF  model  proposed  to  model  the  connection  duration 
and  user  think  time  is  unable  to  explain  this  multifractal  behavior  at  small  time 
scales.  Since  the  traffic  behavior  at  small  time  scales  plays  an  important  role  in 
affecting  connection  performance  such  as  packet  loss  and  throughput,  we  are  mo¬ 
tivated  to  offer  a  physical  model  that  can  precisely  capture  the  traffic  behavior  at 
all  time  scales  and  explain  the  relationship  between  network  parameters  and  con¬ 
nection  performance.  However,  one  has  to  first  understand  the  network  dynamics 
at  small  time  scales. 

There  are  many  protocols  designed  and  deployed  on  the  Internet  for  various 
purposes.  Nevertheless,  the  traffic  driven  by  TCP  (Transmission  Control  Proto¬ 
col)  has  already  dominated  the  entire  network  for  decades.  TCP  has  a  well  known 
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control  mechanism  for  reliable  communication  and  congestion  avoidance  [75].  In 
order  to  avoid  congestion  at  a  bottleneck  router,  the  burst  size  (size  of  pack¬ 
ets)  is  limited  by  the  current  size  of  the  congestion  control  window.  TCP  de¬ 
termines  the  window  size  according  to  its  current  state  and  packet  loss  events. 
The  window  size  update  algorithm  depends  on  TCP  versions  such  as  Reno  and 
Tahoe.  For  example,  the  window  size  of  TCP/Reno  has  a  small  initial  value 
(1MSS  maximum  segment  size)  and  increases  its  size  by  one  MSS  after  receiving 
an  acknowledgement  from  the  receiver.  This  phase  is  called  “slow  start”.  TCP 
leaves  the  slow  start  phase  and  enters  the  congestion  avoidance  phase  if  TCP 
encounters  a  packet  loss  or  its  windows  size  is  greater  than  a  parameter  called 
ssthresh.  The  default  value  of  ssthresh  is  64 KBytes  and  its  value  is  updated  by 
min{cur  rent -congestion  zwindow  size ,  receiver  -window  size)  /  2.  TCP  interprets 
a  packet  loss  event  as  an  indication  of  network  congestion.  In  the  congestion 
avoidance  phase,  TCP  slowly  increases  its  window  size  by  one  packet  with  every 
round-trip  time  and  decreases  its  window  size  to  a  half  when  detecting  a  new 
packet  loss. 

We  consider  a  typical  web  traffic  transported  over  the  Internet.  Since  most 
objects  in  a  typical  web  page  are  small  graphic  and  text  hies,  the  corresponding 
TCP  connection  usually  spends  most  of  its  life  in  the  slow  start  phase  and  the 
packet  arrival  pattern  is  much  like  an  ON/OFF  process.  TCP  sends  a  batch  of 
packets  during  the  ON  period.  The  OFF  time  is  roughly  equal  to  the  network 
round-trip  time  (RTT).  Note  that  the  original  ON/OFF  process  [16]  models  the 
TCP  session  life  time  and  user  think  time.  In  order  to  model  the  traffic  at  small 
time  scales,  we  propose  a  multilevel  ON/OFF  model  to  mimic  the  operational 
behavior  of  a  typical  Internet  connection. 


Til 


I  I 


T21  T20 


T10 


I  I  I 


Figure  3.1:  Traffic  model  for  one  TCP  session 

3.2  Model  Description 


We  propose  a  two  level  ON/OFF  model  for  a  single  TCP  connection  as  shown  in 
Figure  3.1.  The  upper  level  is  an  ON/OFF  process  that  models  the  TCP  session 
life  time  (Tn)  and  user  think  time  (T10).  In  order  to  capture  the  behavior  of  the 
TCP  mechanism,  there  is  another  ON/OFF  process  inside  the  ON  period  Tn  of 
the  upper  level  ON/OFF  process.  It  imitates  the  burst  arrival  pattern  by  the 
active  time  (duration  of  a  burst  T2 1)  and  inactive  time  (T2 o)  within  the  same  TCP 
session.  The  packet  rate  B  {bytes/ A)  in  T-2\  is  assumed  to  be  a  constant.  Tn 
and  T2 1  have  Pareto  Type  I  distributions  with  parameters  (A'n,  an)  and  [K2\ , 
a2i)  respectively.1  The  user  think  time  Tio  and  inactive  time  T2 o  are  chosen  to  be 
Exponential  random  variables  with  mean  1/Aio  and  1/A2q  respectively.  All  these 


1The  Pareto  Type  I  distribution  function  is  as  follows: 


Pr[T  >  t] 


( K/t)a  ,  if  t  >  K 
l  ,  if  t  <  K 
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random  variables  are  statistically  independent  to  each  other. 

T,o  :=  r.v.  Exp  (1/X10) 

Tu  r.v.  Pareto  (Ku,  an) 

T20  :=  r.v.  Exp  ( 1/A20) 

T21  :=  r.v.  Pareto  (K2i,a2i) 

B  :=  Data  rate  within  the  active  period  {bytes/ A) 

IV  :=  Number  of  connections 

The  synthetic  traffic  will  be  generated  by  aggregating  N  independent  multilevel 
ON/OFF  processes  with  burst  rate  B,  which  will  be  formulated  in  (3.1).  ft  is 
desired  to  generate  synthetic  Internet  traffic  traces  that  match  the  real  Internet 
traffic  traces  as  measured  by  the  degree  of  matching  in  the  Logscale  diagram  and 
in  the  multifractal  spectrum. 

3.3  Parameter  Estimation  and  Model  Fitting 

3.3.1  Trace  Format 

Before  discussing  parameter  estimation  and  model  fitting,  we  briefly  describe  the 
format  of  a  real  Internet  trace  observed  in  a  bottleneck  router.  The  raw  trace 
is  recorded  by  tcpdump ,  which  is  a  network  monitoring  tool  developed  by  the 
Department  of  Energy  Lawrence  Livermore  Lab.  To  estimate  the  parameters  of 
the  proposed  traffic  model,  we  need  the  following  information: 

•  Timestamp:  the  arrival  time  of  a  packet 

•  Packet  size:  the  length  of  a  packet 

•  Source  address:  source  host  IP  address 
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•  Destination  address:  destination  host  IP  address 


•  Source  port:  source  TCP  port  number 

•  Destination  port:  destination  TCP  port  number 

•  TCP  flag:  indicate  the  SYN,  SYN-ACK,  and  FIN  packet 

One  can  extract  every  TCP  connection  by  the  source-destination  pair  (Source 
IP-address.port,  Destination  IP-address.port).  The  aggregate  traffic  X,  is  collected 
by  the  time  stamp  and  the  corresponding  packet  size  that  belongs  to  the  ith  time 
interval.  The  time  interval  between  the  SYN  and  FIN  packets  of  a  certain  TCP 
session  is  defined  as  the  session  life  time.  Similarly,  the  time  interval  between 
SYN  and  SYN-ACK  packets  gives  a  measure  of  the  round-trip  time.  From  those 
observations,  we  are  able  to  estimate  the  following  statistics: 

•  Mean  traffic  rate  ( EXt ) 

•  Autocorrelation  function  R{k ) 

•  Logscale  diagram  Lj 

•  Mean  round-trip  time  (ET20) 

•  Mean  session  time  (ET\  1 ) 

Given  a  real  Internet  trace,  our  goal  is  to  estimate  the  corresponding  model  pa¬ 
rameters,  generate  a  synthetic  traffic  trace  from  the  model  and  demonstrate  that 
the  synthetic  traffic  has  similar  statistical  properties  and  queuing  behaviors  with 
the  real  traffic  trace. 
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3.3.2  Parameter  Estimation 


According  to  the  definition  of  the  multilevel  model,  the  aggregate  traffic  can  be 
written  in  terms  of  the  sum  of  i.i.d  indicator  functions. 


Definition  3.3.1  Assume  the  multilevel  ON/OFF  process  is  already  in  the  steady 
state  before  time  0.  Let  Uk(t )  and  14(f)  denote  the  indicator  functions  defined  as: 

Uk(t)  :=  1  {kth  connection  is  in  ON  state  at  t} 

14(f)  :=  1  {kth  connection  is  in  Active  state  at  t}. 

The  aggregate  cumulative  byte  count  of  the  multilevel  ON/OFF  process  is 

pt  N 

Y(t)  —  B  /  Y^Uk(u)Vk(u)du.  (3.1) 

Jo  k= i 

The  single  ON/OFF  process  can  be  treated  as  a  special  case  of  the  multilevel 
ON/OFF  process  by  setting  B  —  1  and  14(f)  =  1  for  all  f.  Hence,  the  aggregate 
cumulative  byte  count  of  a  single  level  ON/OFF  process  in  the  interval  [0,  Tt)  is 
defined  as: 

f-Tt  N 

Y  (Tt)  =  /  V4(«)d«.  (3.2) 

k= i 

Taqqu,  Willinger  and  Sherman  proved  the  following  theorem: 


Theorem  3.3.2  [16]  For  large  N  and  T,  the  aggregate  cumulative  process  {Y(Tt),t  > 
0}  behaves  statistically  like 


TN- 


ET 


n 


ETn  +ET \ 


10 


-f  +  THVNaiimBH(t), 


(3.3) 
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where  the  Hurst  parameter  H  —  (3  —  an)/2  and  Bn(t)  is  the  standard  fractional 
Brownian  motion. 

In  this  theorem,  the  relationship  between  the  Hurst  parameter  H  (index  of 
self-similarity)  and  shape  parameter  an  (index  of  Pareto  distribution)  had  been 
proved.  This  theorem  explained  that  the  self-similarity  of  Internet  traffic  is  mainly 
due  to  the  heavy-tailed  distribution  of  hie  sizes  which  are  typically  transmitted 
over  the  Internet. 

Since  the  lower  level  ON/OFF  process  only  exists  in  the  ON  period  of  the  upper 
level  process,  one  can  easily  have  the  following  relationship  between  Y(t)  and  Y(t): 


Proposition  3.3.3  If  max(ET2i,  ET2o)  «  ETn,  then 

lim  X^-=E[V)B.  (3.4) 

As  time  t  — >  oo,  the  aggregate  cumulative  traffic  of  the  multilevel  ON/OFF 
process  Y(t)  is  statistically  like  the  fractional  Brownian  motion. 

In  order  to  match  the  second  order  statistical  properties  of  the  real  traffic, 
we  have  to  estimate  the  model  parameters  from  the  real  trace.  Instead  of  dealing 
with  the  cumulative  process  Y(t),  we  define  the  increment  process  Xt  of  the  traffic. 


Definition  3.3.4  The  increment  process  ofYft )  is  defined  as 

Xi  :=  Y(fi  +  1)A)  -  Y (*A),  i  =  0,1,  2,  3, ...  (3.5) 

where  A  is  the  minimum  time  resolution  of  interest.  The  increment  process  X.t  is 
interpreted  as  the  total  bytes  that  arrive  in  the  interval  [iA,  (i  +  1)A). 
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Remark  3.3.5  If  B #(t)  is  a  fractional  Brownian  motion  (FBM),  the  increment 
Gi  :=  Bn((i  +  1)A)  —  Bn(i A)  is  a  stationary  sequence  called  fractional  Gaussian 
noise  (fGn). 


Proposition  3.3.6  [65]  The  fractional  Gaussian  noise  Gi  has  the  autocovariance 
function 

RG(k )  =  ^-(\k  +  l\2H -2\k\2H +  \k-l\2H),  keZ 

~  c TqH{2H  —  1  )k2H~2,  as  k  — >  oo  (3.6) 

where  a g  =  Par[Gj]. 

From  (Al)  and  (3.5),  it  is  obvious  that  the  autocovariance  function  of  Xi  has 
the  same  form  as  (3.6)  as  k  — >  oo.  Recall  that  the  introduction  of  wavelet  analysis 
and  equation  (2.30)  suggest  that  the  Logscale  diagram  of  the  process  Xt  has  a 
slope  a  =  2 H  —  1  =  2  —  an  at  the  large  time  scales.  Veitch  and  Abry  [68]  provided 
an  asymptotically  unbiased  and  efficient  estimator  for  the  slope  a  in  the  Logscale 
diagram  within  a  certain  region.  We  will  apply  this  method  to  estimate  the  model 
parameter  an  through  the  estimation  of  slope  a  at  large  time  scale  region. 

Similarly,  we  consider  the  traffic  X,  at  small  time  scale.  Linder  the  assumption 
that  Tu  »  T'2\  and  T2 o,  the  traffic  behavior  at  small  time  scale  is  dominated  by 
the  lower  lever  ON/OFF  process  V(t).  When  we  observe  the  upper  level  ON/OFF 
process  U^ft)  with  a  small  time  scale,  £4(f)  behaves  like  a  constant.  We  may 
rewrite  limt^,0  P(t)  as: 

rt  N 

limF(f)  =  lim B  /  'S^Uk{u)Vk{u)du 

t  >0  t  >0  /  r\ 

k=  1 

N  ft 

«  Bj^Uki 0)  lim  /  Vk(u)du.  (3.7) 


34 


It  is  another  single  (lower)  level  ON/OFF  process  from  the  point  view  of  small 
time  scale.  Based  on  the  structure  of  the  multilevel  ON/OFF  model,  there  is 
another  linear  region  in  the  Logscale  diagram  in  the  small  time  scale  region.  It 
is  contributed  by  the  lower  level  ON/OFF  processes.  One  might  use  the  same 
technique  to  estimate  the  model  parameter  021  in  the  small  time  scale  region. 

The  parameter  Ku  of  Tn  is  estimated  by  matching  the  first  moment  of  the 
session  life  time. 

Ku  ■=  ————ETu  (3.8) 

a  11 

Unlike  estimating  Ku  by  matching  the  mean  of  session  life  time  Tfi,  there 
is  no  control  packet  in  the  real  trace  indicating  the  start  and  end  of  each  active 
period  T2\.  As  mentioned  before,  we  assume  that  the  upper  ON/OFF  process 
always  keeps  its  state  when  we  observe  the  process  with  a  small  time  scale.  The 
parameter  A'21  is  estimated  by  the  normalized  autocorrelation  function  Rn(t)  of  Xj. 


Definition  3.3.7  Let  R(t)  be  the  autocorrelation  function  of  the  multilevel  ON /OFF 
process.  When  t  — >  0,  we  assume  that  the  function  U (t)  is  a  constant  function  and 
have  the  following  approximation: 

R(t)  «  N(AB)2E[U2(0)]E[V(0)V(t)},  as  t  -»•  0. 


We  also  define  7in(t)  :=  Pr[V(t )  =  1|V(0)  =  1]  and  the  normalized  autocorrelation 
function  Rn(t): 


Rn(t ) 


R(t)  _  E[V(0)V(t)] 
R( 0)  ~  E[V2( 0)] 

Pr[V{t)  =  1,U(0)  =  1] 
Pr[V  (0)  =  1] 

7rn(f),  as  t  0. 


(3.9) 


35 


Theorem  3.3.8  [16]  [76]  Assume  that  the  ON/ OFF  process  V(t)  is  stationary, 
the  renewal  equation  for  7Tn  (t)  is 

vrii(t)  =  Gic(t)  +  f  Fic(t  -  u)dHw(u),  (3.10) 

Jo 

where  Gic{t):  =  Pr[residual  life  of  the  first  ON  interval  >  t  \  at  time  0  is  ON\, 
Ficft)  :=  Pr [T2 1  >  t]  and  Hi0(u )  is  the  renewal  function  corresponding  to  the 
inter-renewal  distribution  F{  *  F0.  ( Hw  =  Y^h=i(Ei  *  Eo)*k>  where  *  denotes  the 
convolution  and  F/t),  i  =  1  and  0,  is  the  CDF  of  ON  and  OFF  time  respectively.) 


Since  we  are  interested  in  the  behavior  of  Rn(t)  around  t  =  0,  we  have  the 
approximation 

Rn{t)  «  Glc[t ),  as  t  0. 

Given  that  the  T2 1  has  Pareto(K2i,  a2i)  distribution,  we  have 


Gic(t )  — 


,  I\ 


21 


021 


ET21  Jt  v  u 
021  1 


du 


-t 


—021+1 


Q'21 


The  parameter  K2\  can  be  estimated  by  the  following  estimator: 


K21  :=  A(a21i?„(A))1/(a21”1\  as  A  -  0, 


where 

A)  :=  £vvi+1/£v,2. 

i  i 

The  empirical  results  show  that  Rn{t )  and  G\c(i)  have  a  good  match  when  t  is 

small. 
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To  estimate  the  parameter  I/A20,  or  equivalently  the  mean  inactive  period,  we 
need  to  measure  the  network  round-trip  time  from  the  trace.  It  can  be  extracted 
from  the  real  trace  by  the  duration  between  the  SYN  packet  and  the  SYN-ACK 
packet  at  the  beginning  of  each  TCP  session.  In  our  model,  the  round-trip  time  is 
equal  to  the  lower  level  OFF  time  T20.  We  have  the  mean  of  T20: 

1/  A20  =  et20. 

The  parameter  B  is  the  constant  data  rate  in  the  active  period  T21.  With  the 
assumption  of  independent  connections,  we  have  the  following  proposition: 

Proposition  3.3.9  Let  R\  :=  et^+eTiq  ani ^  ^ 2  :=  et^l+et2o  >  we  ^ ave 


EXi 

=  NABR^-2 

(3.11) 

EX * 

=  Y(A5)2i?i/?2. 

(3.12) 

and 

B  EXf 
A  EXi  ' 

Thus  the  estimator  B  is  defined  as 

6. 

A  Y^i  xi 

Equation  (3.11)  implies  that  there  is  one  degree  of  freedom  to  choose  N  and 
i?i(or  ETi 0  equivalently).  In  order  to  satisfy  the  assumption  in  the  theorem  3.2, 
one  needs  to  select  a  large  integer  for  N  so  that  the  mean  OFF  time  I/A10  can 
be  determined  by  Ri  in  (3.11).  Since  T10  is  an  Exponential  random  variable  and 
ETW  »  ET\  1 ,  the  starting  time  of  each  TCP  session  can  also  be  approximated 
by  a  Poisson  process  as  N  — >  00. 
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Figure  3.2  shows  the  network  topology  and  bottleneck  router  (Satellite  Gate¬ 
way)  of  the  DirecPC  system,  of  Hughes  Network  systems.  We  measure  the  model 
parameters  from  the  downlink  traffic  (from  Internet  to  user)  and  generate  a  syn¬ 
thetic  traffic  trace  by  our  model  and  the  corresponding  model  parameters.  At 
least  ten  very  long  traces  (more  than  a  hour)  with  a  high  time  resolution  (one  mil¬ 
lisecond)  are  analyzed  and  compared  by  our  model.  For  example,  one  real  trace 
recorded  on  17:00-18:00  Oct.  13  1999  is  analyzed  and  compared  by  the  corre¬ 
sponding  synthetic  traffic  trace.  According  to  the  previous  parameter  estimation 
methods,  we  have  the  following  results.  The  mean  round-trip  time  and  the  mean 
TCP  session  time  are  0.130sec  and  4.896sec  respectively.  The  aggregate  trace 
has  a  mean  rate  513.98 bytes /msec  and  variance  9.8990  x  10 5bytes2 /msec.  The 
normalized  autocorrelation  function  Rn(lmsec)  is  0.3519.  The  shape  parameters 
an  and  0-21  are  estimated  by  the  slopes  of  the  Logscale  diagram  in  the  regions  of 
small  and  large  time  scales  as  shown  in  Figure  2.3  and  3.3.  Table  3.1  provides  the 
corresponding  parameters  of  this  model. 

With  our  model  and  the  estimated  model  parameters,  we  generate  the  synthetic 
trace  by  the  constant  bit  rate  assumption  within  the  lower  level  ON  duration  T21 
of  our  model  (within  the  period  of  T2 1,  packets  are  generated  every  one  millisecond 
with  packet  size  B  bytes  ),  as  shown  in  Figure  3.1.  The  aggregate  synthetic  trace 
are  generated  by  aggregating  N  independent  such  connections. 


Table  3.1:  V 

odel  Parameters 

Para. 

Kn 

an 

1/ A10 

B 

Value 

1.27sec 

1.35 

167.55sec 

1926bytes/ms 

Para. 

K21 

0-21 

1/ A  20 

N 

Value 

0.54ms 

1.77 

128.75ms 

1000 
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Figure  3.2:  Topology  of  DirecPC  network  and  the  bottleneck  gateway 

3.4  Second  Order  Statistics 

We  employ  the  discrete  wavelet  transform  to  analyze  the  real  and  synthetic  traffic. 
The  second  order  analysis  of  the  traffic  is  obtained  by  studying  the  detail  process 
of  wavelet  transforms  dj^-  As  mentioned  previously,  the  Xt,  i  =  0,1,2,..  is  the 
time  series  of  total  bytes  transmitted  during  the  interval  [*A,  (i  +  1)A).  In  order 
to  avoid  the  estimation  error  from  the  deterministic  trend,  the  mother  wavelet  of 
the  discrete  wavelet  decomposition  is  chosen  to  be  the  Daubechies  wavelet  with 
the  number  of  vanishing  moments  N  —  3.  The  Logscale  diagram  in  Figure  3.4 
is  the  energy  of  the  detail  process  log2E[d‘jk]  v.s.  the  octave  j  of  the  real  traffic 
and  synthetic  traffic.  It  is  easily  seen  that  the  second  order  statistics  of  these  two 
traffic  traces  have  almost  the  same  values  on  every  scale.  This  match  also  implies 
their  similar  autocorrelation  structures  in  time. 

There  is  a  breaking  point  around  the  scale  j  =  11  (2nA  =  2.048sec)  related 
to  the  minimum  value  of  Tu  (=K n  in  the  model).  When  the  observing  time  scale 
is  smaller  than  2  sec.  in  this  trace,  the  traffic  behavior  is  dominated  by  the  lower 
level  ON/OFF  process  or  equivalently  by  the  TCP  congestion  control  mechanism. 
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The  figure  3.4  shows  that  TCP  dynamics  can  be  modeled  well  in  the  second  order 
behavior  by  a  multilevel  ON/OFF  process.  On  the  right-hand  side  of  the  breaking 
point,  the  behaviors  of  the  real  and  synthetic  traces  are  both  monofractal  with 
the  same  Hurst  parameter  ( H  «  0.823).  Note  that  the  slope  a  and  the  Hurst 
parameter  [66]  have  the  relation  a  =  2 H  —  1. 
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3.5  Higher  Order  Statistics 


We  employ  wavelet-based  multifractal  analysis  to  analyze  real  and  synthetic  traf¬ 
fic  traces.  Unlike  the  Logscale  diagram  which  only  considers  the  second  order 
statistics,  this  multifractal  analysis  method  considers  all  moments  of  the  stochas¬ 
tic  process  in  the  wavelet  domain.  The  higher  order  statistics  are  extracted  by  the 
structure  function  S(q,j)  and  the  partition  function  T(q)  defined  in  [25]  [77], 

S(q,j)  :=  X  l|2-<I-i)/2d/J« 

k= 1 

where  L  :=  /or/2 (Data  Length).  T(q)  is  approximated  by  the  slope  of  log2S(q,j) 
when  j  is  small.  The  multifractal  spectrum  f(a)  is  the  Legendre  Transform  of 

T(q): 

f(a)  :=  inf(«a  -T(q)). 

1 

The  multifractal  spectrum  /(a)  provides  a  measure  of  “frequency”  of  the  singu¬ 
larity  exponent  aft)  at  time  t.  It  indicates  the  probability  of  a  certain  value  of  the 
singularity  exponent: 

Pr[aft )  =  a]  ~  2~L('W(a)) 

For  a  monofractal  process,  like  the  fractional  Gaussian  noise  ( fGn ,  the  increment 
of  fractional  Brownian  motion),  its  singularity  exponent  a(t)  is  a  constant  H  for 
every  t.  This  might  be  considered  as  a  degenerate  case  of  multifractality.  The 
corresponding  partition  function  T(q)  =  qH  —  1  of  fGn  is  a  linear  function  of 
q.  Since  the  aft)  is  equal  to  H  for  every  t  in  fGn,  its  multifractal  spectrum 
should  be  a  single  point  at  (H,  1).  We  will  employ  the  fGn  as  a  pilot  process 
and  compare  the  multifractal  spectrum  with  the  real  and  synthetic  traffic.  For 
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a  multifractal  process,  the  partition  function  is  a  concave  function  of  q  and  the 
singularity  exponent  a(t)  has  a  wide  range  of  values.  In  other  words,  there  is  a 
non- negligible  probability  that  a{t)  is  equal  to  other  values.  Figure  3.5  shows  the 
partition  functions  of  the  real  trace,  of  the  synthetic  trace  and  of  the  fGn.  The 
concave  curves  of  the  partition  function  show  that  the  real  traffic  and  synthetic 
traffic  are  multifractal  processes  and  the  partition  function  of  fGn  is  a  linear 
function  due  to  its  monofractal  behavior.  It  is  much  clearer  to  see  the  difference  in 
their  multifractal  spectra  in  Figure  3.6.  The  spectrum  of  fGn  shows  the  probability 
Pr[a{t)  =  H]  fa  1.  For  the  real  and  synthetic  traffic,  their  spectra  show  a  rich 
variety  of  singularity  exponents  with  a  non- negligible  probability.  Moreover,  the 
spectrum  of  our  model  is  not  only  multifractal  but  also  has  the  same  shape  as  that 
of  the  real  traffic. 
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3.6  Queuing  Behavior 


After  comparing  the  statistical  properties  of  the  real  and  synthetic  traffic,  we  are 
also  interested  in  their  queuing  behaviors.  Figure  3.2  shows  network  topology  and 
bottleneck  router  (Satellite  Gateway)  of  the  real  DirecPC  system.  The  arrival 
traffic  of  bottleneck  router  is  aggregated  downlink  traffic  (from  Internet  to  user), 
which  is  modeled  by  our  traffic  model.  In  order  to  predict  the  queuing  behavior  of 
the  bottleneck  router,  we  consider  a  simple  first-come-first-serve  queuing  system 
(FIFO)  with  a  fixed  service  rate  and  an  infinite  buffer  size  as  shown  in  Figure 
3.7.  Given  the  mean  rate  of  arrival  traffic,  the  queue  length  distributions  under 
different  utilization  are  obtained  by  properly  adjusting  the  service  rate.  Figure 
3.8  shows  the  steady  state  queue  length  tail  distributions  Pr[Q  >  x]  with  various 
values  of  utilization  p  =  0.6,  0.7,  0.8  and  0.9.  When  the  traffic  load  is  heavy  (large 
utilization),  the  real  and  synthetic  traffic  have  almost  the  same  distributions.  With 
the  light  traffic  load,  the  synthetic  traffic  also  provides  a  good  prediction  for  the 
queue  length  distribution  when  the  queue  length  is  less  than  50K  bytes.  In  the 
region  of  large  queue  lengths,  the  tail  queue  length  distribution  is  overestimated. 
However,  this  event  happens  with  a  small  probability  due  to  the  light  traffic  load. 


To  end  users 


FIFO  queue  in  gateway  with  fixed  bandwidth 

Aggregated  downlink  traffic 


Figure  3.7:  First-come-first-serve  queue  in  satellite  gateway 
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Queue  Length  x  (KBytes) 


Figure  3.8:  Queue  Length  Tail  distribution  of  Real  Trace  and  Synthetic  Traffic 
with  Utilization  p  =  0.6,  0.7,  0.8  and  0.9 


3.7  Summary 


In  this  chapter,  we  propose  a  multilevel  ON / OFF  model  to  capture  the  multifractal 
behavior  of  Internet  traffic.  The  idea  of  this  model  is  to  mimic  the  arrival  pattern 
of  packets  within  a  TCP  section.  The  network  parameters  such  as  round-trip  time 
are  involved  in  the  model.  In  the  next  chapter,  we  address  that  this  model  can 
be  a  useful  tool  to  understand  the  impact  of  network  parameters  on  connection 
performance.  We  develop  a  simple  algorithm  to  estimate  the  parameter  from  a 
real  traffic  trace  and  generate  a  synthetic  traffic.  The  second  order  and  higher 
order  statistics  comparisons  were  performed  by  Logscale  diagram  and  multifrac¬ 
tal  spectrum.  Finally,  we  compare  their  queuing  behaviors  such  as  steady  state 
distribution  by  simulations. 
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Chapter  4 


Performance  Analysis  of  Queuing 
Behaviors 

4.1  Overview 

In  Chapter  3,  we  propose  a  multilevel  ON/OFF  model  for  Internet  traffic.  In 
this  chapter,  we  focus  on  theoretic  results  of  traffic  statistical  properties  and  its 
corresponding  queuing  behavior  such  as  steady  state  queue  length  distribution, 
mean  packet  delay  and  jitter. 

Since  the  Logscale  diagram  comprises  important  statistical  properties  of  the 
traffic,  we  are  looking  for  an  explicit  expression  of  Logscale  diagram  as  a  func¬ 
tion  of  model  parameters.  Section  4.2  contains  an  approximation  of  the  Logscale 
diagram  by  considering  the  partial  integral  with  power  spectrum  density  and  the 
autocorrelation  function  of  the  traffic  model.  This  explicit  expression  of  Logscale 
diagram  will  help  us  understand  the  relationship  between  network  parameters  and 
performance  of  TCP  connections. 

Given  the  Logscale  diagram  and  mean  rate  of  traffic,  we  develop  a  recursive 
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method  in  Section  4.3.  In  this  method  we  apply  the  properties  of  wavelet  analysis 
to  compute  the  variances  of  aggregate  traffic  workload.  This  simple  transforma¬ 
tion  allows  us  to  obtain  the  workload  variance  at  every  time  scale  from  Logscale 
diagram.  Furthermore,  assuming  that  the  traffic  workload  has  a  Lognormal  distri¬ 
bution,  one  can  easily  predict  the  steady  state  queue  length  distribution  by  using 
the  same  technique  provided  in  [78]. 

For  most  of  real  time  applications  such  as  voice  and  video  conference,  packet 
delay  and  jitter  are  important  factors  for  connection  performance.  In  Section 
4.4,  we  discuss  the  delay  and  jitter  of  a  CBR  (Constant  Bit  Rate)  connection  in 
the  bottleneck  queue  with  background  traffic.  The  CBR  connection  has  a  fixed 
packet  interarrival  time  and  fixed  packet  size.  It  is  employed  here  as  a  probing 
process.  According  to  our  prediction  of  steady  state  queue  length  distribution, 
the  mean  CBR  packet  delay  of  a  FIFO  queue  is  obtained  straightforward.  We  also 
develop  a  method  to  provide  a  tight  upper  bound  of  jitter  according  to  the  Logscale 
diagram  of  the  background  traffic.  Both  predictions  of  mean  delay  and  jitter  are 
validated  by  ns2  simulations.  This  fast  algorithm  could  provide  useful  information 
for  network  management  such  as  routing  decision  and  resource  allocation. 

4.2  Analytical  Solution  of  Logscale  Diagram 

We  already  demonstrated  that  the  multilevel  ON/OFF  model  has  a  good  match 
in  their  statistical  properties  and  queuing  behaviors  in  Chapter  3.  A  question 
that  arises  naturally  is  whether  one  can  find  the  relationship  between  network 
parameters  and  connection  performance.  The  first  step  of  our  approach  is  to  find 
an  explicit  expression  of  the  Logscale  diagram  as  a  function  of  network  parameters. 

Recall  the  model  definition  in  Chapter  3,  the  aggregate  cumulative  byte  count 
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of  the  multilevel  ON/OFF  process  is  given  in  Definition  3.3.1  as 

rt  N 

Y(t)  —  B  /  Y^Uk{u)Vk{u)du.  (4.1) 

Jo  k= i 

The  real  traffic  is  modeled  by  the  increment  process  Xt  of  Y(t),  which  is  defined 
in  3.2.4: 


x,  :=  y((*  +  i)A)-y((i)A) 

r(i+ 1)A  N 

=  B  I  Uk(u)Vk{u)duy  i  =  0, 1,2,3, ... 


(4.2) 


'  iA 


k= 1 


Since  Xt  is  the  sum  of  N  i.i.d  process  Zik, 

r(i+  1)A 


Zitk  -=B  Uk(u)Vk(u)du, 

JiA 


(4.3) 


where  k  —  1,  2, N.  It  is  obvious  that  Xi  and  Z*  have  the  same  autocovariance 
structure: 


Cov{Xh  Xi+n)  =  NCov{Z u  Zi+n).  (4.4) 

Our  goal  here  is  to  calculate  the  Logscale  diagram  of  X*,  which  can  be  treated  as  the 
signal  energy  in  a  certain  frequency  range.  In  order  to  obtain  the  power  spectrum 
density  of  the  process  X,-,  we  have  to  calculate  the  corresponding  autocovariance 
Cov(Xi,  Xi+n). 

We  consider  the  lower  level  ON/OFF  process  V{t).  Let  the  random  variables 
T2 1  and  T20  be  the  duration  of  ON  and  OFF  with  mean  fii  and  fi0  respectively. 
The  ON/OFF  process  starts  at  t  =  —oo  and  reaches  the  steady  state  before  t  =  0. 

According  to  the  Definition  3.3.7  and  Theorem  3.3.8,  the  autocovariance  func¬ 
tion  of  V ( t )  in  the  steady  state  is 

r(t)  :=  E[r(0)VV)]  -  (£[r(0)])2  =  —*£—[*•„(«)  -  -A—  ]  (4.5) 

hi  +  ho  hi  +  ho 
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and 


TCnit)  =  Glc(t)  +  f  Fic(t  -  u)dHw(u).  (4.6) 

Jo 

In  order  to  obtain  the  power  spectrum  density  function,  we  first  consider  its  Laplace 
transform  r(s) . 

Proposition  4.2.1  [16]  [76]  The  Laplace  transform  of  irn(t)  is  : 

7rn(s)  =  Gic('S)  +  hio(s)(l  —  fi(s))/s  (4-7) 


where 


and 


m  := 

Gic(s) 


1 


Ficju) 

hi 


du} 


hi 

1 


^f{hi  -  /  Flc{u)du] 


^{hi 


hi 

1  _  1  ~  fi(s) 

S  hi  S2 


fi(x)dx]du} 


0 


hm(s)  = 


(!  -  /.(»))/«(») 


(4.8) 


(4.9) 


(4.10) 


it  i*(i  -  ZiW/oW) 

Note  that  the  fft),  i  =  0, 1  is  the  pdf  of  the  ON  and  OFF  duration  respectively. 
The  Laplace  transform  of  r(t)  is 

hiho  (1  -  A(s))(l  -  fo(s)) 


r„s  = 


(hl+ho)2S  (h!  +ho)52(1  -  fl(s)fo(s)) 


(4.11) 


Case  1:  Exponential  distribution  at  ON  duratoin 

If  the  ON/OFF  periods  T2i  /T2q  have  the  Exponential  distributions  with  mean 
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1/Ai  =  /I \  and  1/Ao  =  /Jo  respectively,  we  have  the  density  function  and  the 
Laplace  transform  of  the  ON  and  OFF  duration  as  follows: 


fi(t)  =  Afexp(— Ai«) 
fi(s)  = 


(4.12) 

(4.13) 


s  +  Aj 

where  i  —  1  and  0.  The  corresponding  r(s),  r(t)  and  the  power  spectrum  density 
S(u)  of  this  Exponential  ON/OFF  process  are 

—  .  HiHo  1 


r  is)  = 


r(t)  = 


(Hi  +  /xo)2  s  +  A 
1 


—A  t 


A(/xi  +  no) 


S(uj)  =  /  r(t)e  ■‘wtdt  = 


(4.14) 

(4.15) 

(4.16) 


(1/Ai  +  1/A0)(ca2  +  A2)’ 
where  A  =  Ai  +  Ao-  With  the  Proposition  2.2.5,  we  have  the  approximation  of 
E[d2k]  by  assuming  that  T(c<j)  is  an  ideal  bandpass  function: 


r7r/2^  1 

E{dlk\  ~  2  /  S(W)2l||>P(2la.)f^ 

Jir/23 

2i+2  7T  /  A  7t/A 

~  — — — - —-(arctan^—r—arctan — — ). 

A(1/A1  +  1/A0)V  2J-1  2J  J 


(4.17) 

(4.18) 


Take  the  logarithm  on  both  sides,  the  Logscale  diagram  of  single  ON / OFF  process 


is 


Lj  :=  log2E[d?k\  «  l°g2{  +  i/a0)  ^arctan^pi  ~  arctan^)}  (4.19) 

Recall  that  there  are  N  independent  connection  with  Burst  rate  B  and  the  upper 
level  has  a  ON/OFF  ratio  R\ .  The  Logscale  diagram  of  the  aggregate  traffic  at 
small  time  scale  region  is  approximated  by 


L, 


Lj  +  log2{N{BRi 


(4.20) 
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Since  both  ON  and  OFF  duration  have  Exponential  distributions,  the  auto¬ 
covariance  function  r(t)  at  (4.15)  also  has  an  exponential  form  which  indicates  a 
short-range  dependence  property.  The  Logscale  diagram  of  this  short  range  de¬ 
pendent  ON/OFF  process  is  shown  in  Figure  4.1  with  the  same  mean  ON  time 
and  OFF  time  estimated  in  Chapter  3.  Note  that  the  zero  slope  of  the  Logscale 
diagram  at  large  time  scales  demonstrates  the  absence  of  long  range  dependence 
and  obviously  the  Exponential  distribution  assumption  of  ON / OFF  duration  is  not 
valid  for  modeling  a  real  traffic.  The  heavy-tailed  type  distribution  such  as  Pareto 
distribution  is  suggested  as  a  candidate  for  modeling  the  long-range  dependence 
traffic  [16]  [1]  [66].  However,  there  is  no  closed-form  for  the  autocovariance  function 
of  such  ON/OFF  process  with  Pareto  distribution.  We  provide  an  approximation 
for  the  Logscale  diagram  of  the  ON/OFF  model. 

Case  2:  Pareto  distribution  at  ON  duration 
The  density  function  of  Pareto(K,  a)  distribution  fp(t)  =  aKat~a~l  can  be  approx¬ 
imated  (truncated  tail)  [79]  by  the  weighted  sum  of  N  density  functions  of  Expo¬ 
nential  distribution  with  mean  yn/z/,  n  =  0, 1, ...,  N  —  1.  Define  r.v.  YN-—Yln=o  Xn, 
where  r.v.  Xn  has  an  Exponential  density  function: 

:=^ex (4.21) 

Then  the  pdf  of  YN  is: 

N-l 

=  G(JV)£S”iDxp(-iio,  (4.22) 

n= 0  '  ' 

where  parameter  6  is  the  weighting  factor.  Note  that  J0°°  fyN(t )  =  1  for  probability 
density  function,  we  have  the  normalization  factor  G(N )  =  jEim  ■ 

Recall  that  the  ath  moments  of  Pareto  distribution  is  diverging  when  a  is 
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greater  or  equal  to  the  shaping  parameter  a.  i.e. 


E[Pa }  = 


tafP(t)dt 


Ik 


=  aK° 


t 


—a—  l-\-a 


dt 


(4.23) 


Ik 


diverges  if  a  >  a.  Hence,  we  consider  the  ath  moments  of  r.v.  YN: 

1  _  (0Y)n 

E[YS\  =  G(N)  -  1  E\X%\. 


Let  N  — >  oo,  we  have 


1  -  6*7° 


lim  E[YZ\  =  1  f  E[XS]. 

N-> oo  1  Ni  1  -  6*7“  1  0J 


(4.24) 


(4.25) 


Since  X0  has  Exponential  distribution,  we  have  E[X g]  <  oo  for  all  a  <  oo.  How¬ 
ever,  if  9rya  >  1,  the  limit  in  (4.25)  diverges.  We  are  free  to  choose  the  scaling 
factor  to  be  7  =  2,  then  the  weighting  factor  9  =  2~a.  The  mean  of  the  mixed 
density  function  is  E[Yjy]  =  G(N )  ®n~V-  In  order  to  match  the  mean  of  the 

Pareto  distribution  aK/(a  —  1),  we  have 

1  —  (Q'y)N  a  —  1 


v  =  G(N)- 


1  -  07  aK 


(4.26) 


The  Pareto  distribution  approximation  by  sum  of  weighted  Exponential  distribu¬ 
tions  are  shown  in  Figure  4.2. 
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t 


Figure  4.2:  Probability  density  function  of  Pareto  and  sum  of  weighted  Exponential 
distributions  with  7V=1,2,...,30. 


The  Laplace  transform  of  fyN(s )  is: 


N—l  ,  n 

(4.27) 

77=0  ^  '  ' 

Substitute  fo(s)  :=  and  fi(s)  :=  fyN{s )  into  equation  (4.11),  the  Laplace 
transform  of  the  autocovariance  function  is  in  (4.28). 


rv  s  = 


MiMo 


GW  ELo 


Ap 

s+Aq 


_i_  r(  v1-1  ffn  A° vhn 

'  \  )  Z—/n=0  Ap  — v/^/n  V  s-\-v/')ri 


(/C  +  /h))s 

1  1 


s+Ao ' 


(4.28) 


(/L  +  ^o)-S2[l  —  G(N)  Yln= 0  0 


,  A0z^/7n  /  l _ 1 

Aq — v/r)n  \s-\-v/')n  s+Ap 


)] 


The  power  spectrum  density  of  rv(t)  can  be  calculated  by  the  following  relation: 

’S'v(^)  ^"v  (®)  |  s=— ju;  T  Tv  (s)  |  s=ju>-  (4.29) 

Assuming  that  the  mother  wavelet  -0  is  an  idea  bandpass  function,  the  Logscale 
diagram  Lj  of  a  single  connection  at  small  time  scales  can  be  approximated  as 
follows: 

Lj  =  log2E[d)k] 

~  log2{[  Sv{u)2i\\V{2*u)\\2dtj}}.  (4.30) 

J  —7 r 

Again,  the  Logscale  diagram  of  the  aggregate  traffic  is: 

Lj  «  Lj  +  log2(N(BR1)2).  (4.31) 

The  dash  and  dot  lines  at  Figure  4.1  compare  the  theoretical  approximations 
of  Logscale  diagram  with  Exponential  (Case  1)  and  Pareto  distribution  (Case  2) 
respectively.  It  shows  that  the  equation  4.31  provides  a  good  approximation  for 


59 


the  Logscale  diagram  at  small  time  scales.  With  this  formula,  we  are  able  to  un¬ 
derstand  the  relationship  between  network  parameters  and  the  traffic  statistical 
properties.  Figure  4.3  and  4.4  also  show  the  Logscale  diagrams  of  numerical  ap¬ 
proximation  and  synthetic  traffic  with  different  network  parameters.  Figure  4.3 
shows  the  effect  of  increasing  the  burst  size  by  two  times  and  four  times.  In 
Figure  4.4  both  ON(T2i)  and  OFF  (T2o)  time  periods  are  increased  in  order  to  keep 
the  same  mean  rate.  In  those  two  figures  both  synthetic  and  theoretical  results 
predict  the  statistical  properties  of  real  traffic  with  different  network  parameters. 

The  second  step  of  onr  approach  is  to  find  the  queuing  behaviors  according  to 
the  Logscale  diagram  of  aggregate  traffic.  Given  the  Logscale  diagram  (numerically 
estimated  from  the  trace  or  theoretically  computed  from  the  model),  we  develop 
an  efficient  algorithm  in  the  next  section  to  predict  the  queue  length  distribution 
and  the  corresponding  connection  performance  such  as  delay  and  jitter.  With  this 
technique,  one  can  employ  the  network  parameters  to  predict  the  queuing  behavior 
of  a  bottleneck  router.  It  builds  up  a  relationship  between  the  network  parameter 
and  the  connection  performance,  which  is  very  helpful  in  protocol  design.  Fur¬ 
thermore,  based  on  the  prediction  of  the  queuing  behavior,  one  may  adjust  control 
parameters  in  a  router  such  as  the  maximum  threshold  and  the  dropping  probabil¬ 
ity  of  the  RED  (Random  Early  Detection)  scheme.  We  will  discuss  these  problems 
in  the  following  chapters. 
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Figure  4.4:  Logscale  diagram  of  Real  Trace  and  Synthetic  Traffic 


4.3  Approximation  of  Steady  State  Queue  Length 


Distribution 

As  shown  in  the  Chapter  2,  the  wavelet  method  has  many  advantages  in  param¬ 
eter  estimation  and  traffic  analysis.  Given  the  Logscale  diagram  (numerically  or 
theoretically)  of  the  arrival  traffic,  we  develop  an  efficient  method  to  obtain  an 
upper  bound  of  the  overflow  probability  by  using  the  properties  of  wavelet. 


Proposition  4.3.1  Let  a3^  and  dj:k  be  the  approximation  coefficient  and  the  detail 
coefficient  of  the  Haar  wavelet,  we  have  the  following  relations  according  to  Remark 
2.1.7 


Oj+i,fc  +  dj. i-i,fe 

%2fc  “  — "vf 

®7  +  l,fc  ^7  +  1,  k 

a-2k+1  =  — • 


(4.32) 

(4.33) 


With  the  uncorrelated  assumption  between  a.J+\.k  and  dJ+\.k  for  each  j ,  we  have 


V ar[aj  = 


V  ar  [a  j+f\  +  V  ar[dj+i } 


(4.34) 


Note  that  the  plot  log2  Var[dj\  v.s  j  is  the  Logscale  diagram.  On  the  other  hand, 
let  the  workload  Aj  to  be  the  total  arrival  bytes  in  the  interval  [0,  2J'A).  From  the 
definition  of  Haar  wavelet,  we  have 


A,  =  a,Vl\ 


(4.35) 


Thus,  by  using  the  Logscale  diagram  and  the  variance  of  workload  at  the  finest 
scale  Var[Ao]  as  the  initial  values,  the  variance  of  workload  V  ar  [A  f\  can  be  com¬ 
puted  recursively  for  all  j. 
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Proposition  4.3.2  Given  the  Logscale  diagram  Lj  :=  log2Var[dj]  and  the  vari¬ 
ance  of  A0,  the  variance  of  Aj,  j  =  1,2, ...  is 

V ar[Aj]  =  2^Var[aj] 

Var[aj]  =  2Var[a,j_i\  —  V  ar[df]. 

Since  the  workload  Aj  is  contributed  by  many  independent  connections  which 
are  active  in  the  the  interval  [0,  2JA),  The  Central  Limit  Theory  suggests  that  the 
candidate  distribution  of  workload  should  be  Normal  distribution  with  parameters 
( EAj ,  V ar[Aj ]).  Figure  4.6  shows  that  the  Normal  distribution  works  well  at  large 
time  scales  in  fitting  the  empirical  distribution,  ffowever,  the  packet  arrival  pattern 
is  very  bursty  and  spare  at  small  time  scales.  There  are  few  connections  have 
packets  arriving  in  a  small  interval.  The  assumption  of  large  number  aggregation 
of  independent  connections  is  invalid.  On  the  other  hand,  the  Normal  distribution 
always  has  a  positive  probability  for  negative  values.  This  property  is  not  suitable 
for  modeling  a  positive  random  variable  such  as  traffic  workload.  We  are  looking 
for  a  distribution  of  positive  random  variable  which  can  best  fit  the  empirical 
distribution  of  real  traffic  at  all  time  scale.  We  applyied  the  Kolmogorov- Smirnov 
Goodness-of-Fit  test  [80]  [81]  [82]  to  most  well-known  distributions  of  positive 
random  variables  and  found  the  Lognormal  distribution  is  a  good  choice. 

Assuming  that  the  workload  Aj  has  the  Lognormal  distribution  for  all  j  with 
mean  Mj  and  variance  V},  the  probability  density  function  of  the  Lognormal  dis¬ 
tribution  is: 

fAj(x):= - ~y==  exp [—  (ln X  ^  ],  x  >  0  (4.36) 

xOj\'2n  20j 

Since  the  rth  moment  of  the  Lognormal  distribution  has  a  closed-form: 

r2o 2- 

EA r:  =  exp (r/j,  4 - —A).  (4.37) 

J  2 
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The  parameters  ji3  and  Oj  can  be  easily  obtained: 


a 


Mf 


cr 


/L  =  Hmj)  ~  -f- 


(4.38) 

(4.39) 


Figure  4.5  and  Figure  4.6  are  the  complement  CDF  of  the  real  workload  with 
scale  j  =  1,2,  ..,14  and  the  corresponding  Normal  and  Lognormal  distribution 
fitted  by  the  estimated  mean  and  variance.  It  is  obvious  that  the  prediction  of 
Normal  distribution  is  severely  underestimated  at  small  time  scales. 
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Figure  4.5:  The  CCDF  of  workload  Aj  j  =  1,2,...,  7  and  the  fitted  Normal  and 
Lognormal  distribution. 
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Figure  4.6:  The  CCDF  of  workload  Aj  j  =  8,9, ...,  14  and  the  fitted  Normal  and 
Lognormal  distribution. 
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Consider  a  simple  FIFO  queue  with  a  finite  buffer  size  B  (bytes)  and  capacity  C 
(bytes/ A).  Assuming  that  the  distribution  of  workload  Aj  is  known  for  all  j,  Riedi 
[78]  proposes  a  simple  upper  bound  for  the  overflow  probability  P[Q  >  B ],  where 
r.v.  Q  is  the  queue  length  in  the  steady  state.  Let  the  event  Ej  :=  {Aj  <  B+C 2J  A} 
be  assumed  to  be  independent  of  each  other.  According  to  the  lemma  proved  by 
Riedi: 

Lemma  4.3.3  [78]  Assume  that  the  event  Ej  =  {Sj  <  bj},  where  Sj  =  Rq  + 

...  +  Rj-i  for  1  <  j  <  n  and  where  Ro,..., Rn  are  independent,  otherwise  arbitrary 
random  variables.  Then,  for  1  <  j  <  n,  we  have 

PiEjlEj-u.^Eol^PiEj].  (4.40) 

With  Lemma  4.3.3,  we  have  the  upper  bound  of  the  overflow  probability: 

Proposition  4.3.4  Let  Ej  :=  {Aj  <  B  +  C2-7  A}.  An  upper  bound  of  the  overflow 
probability  of  a  FIFO  queue  is 

p[Q  >  B]  =  1-P[Q<B\^1-  P[nf=1Ej] 

K 

3= 1 

K  K 

<  i-n^i=i-n  P[Aj  <  B  +  C2j A],  (4.41) 

3=0  3=0 

where  K  is  the  maximum  octave  and  2aA  is  the  maximum  time  scale. 


Figure  4.7  shows  this  upper  bound  with  different  bandwidth  utilization  p  = 
0.1,  0.2, 0.9.  Figure  4.8  to  4.11  are  the  comparisons  of  this  upper  bound  with 
the  queue  length  distribution  of  the  real  network  traffic  and  the  synthetic  traffic. 
These  results  suggest  that  the  upper  bound  of  overflow  probability  may  provide  a 
good  approximation  of  the  queue  length  distribution.  In  the  next  section,  we  will 
use  this  approximation  to  predict  the  mean  queuing  delay  and  choose  the  RED 
parameter  properly  to  satisfy  the  given  mean  delay  requirement. 
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When  we  calculate  the  upper  bound  of  the  overflow  probability,  we  find  that 
only  one  workload  Aj  at  certain  time  scale  j  dominates  the  overflow  probability. 
Figure  4.12  shows  that  in  the  light  traffic  condition  (p  =  0.3),  the  traffic  perturba¬ 
tion  at  small  time  scales  mainly  causes  the  overflow  events.  Figure  4.13,  4.14  and 
4.15  show  that  when  the  traffic  load  is  increasing,  the  overflow  probability  of  the 
queue  will  be  mainly  dominated  by  traffic  behavior  at  larger  time  scales.  On  the 
other  hand,  with  a  fixed  utilization,  the  dominating  time  scale  is  related  to  the 
buffer  size.  The  smaller  buffer  size  has  a  smaller  dominating  time  scale.  It  implies 
that  the  short  range  dependence  in  the  traffic  dominates  the  queuing  behavior 
when  the  buffer  size  is  small.  Note  that  most  of  dominating  time  scales  are  less 
than  1  sec.  The  statistical  properties  of  the  traffic  at  small  time  scale  is  directly 
related  to  the  protocol  behavior.  It  suggests  that,  for  the  small  buffer  queue,  the 
protocol  behavior  plays  a  more  important  role  than  the  hie  size  distribution  does 
at  the  upper  level.  It  also  suggests  that  the  monofractal  or  single  level  ON/OFF 
model  is  not  quite  suitable  for  predicting  the  behavior  of  a  small  buffer  queue. 

The  selection  of  maximum  octave  K  also  determines  the  upper  bound.  Riedi 
[78]  suggested  that  the  maximum  time  scale  2aA  should  be  large  enough  to  con¬ 
tain  an  empty  event.  Figure  4.12  to  4.15  show  that  the  upper  bound  will  converge 
as  K  goes  to  infinite.  In  practice,  the  maximum  octave  K  can  be  chosen  large 
enough  so  that  the  probability  P[Ej]  converges  to  1. 
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Figure  4.12: 


The  Pr[Aj  <  B  +  2JC'A]  v.s.  octave  j,  buffer 
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size  B 


10/1, 20/1, 400/1  (bytes)  with  utilization  p= 0.3 
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Figure  4.13:  The  Pr[Aj  <  B  +  2J,CA]  v.s.  octave  j,  buffer  size  B 
10/1, 20/1 , 400/1  (bytes)  with  utilization  p=0.5 
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Figure  4.14:  The  Pr[Aj  <  B  +  2J,CA]  v.s.  octave  j,  buffer  size  B 
10K,  20/1, 400/1  (bytes)  with  utilization  p— 0.7 


Figure  4.15:  The  Pr[Aj  <  B  +  2J,C'A]  v.s.  octave  j,  buffer  size  B 
10K,  20K, ...,  400iF  (bytes)  with  utilization  p= 0.9 


4.4  Queuing  Delay  and  Jitter  Analysis 


With  the  prediction  of  steady  state  queue  length  distribution,  it  is  straightforward 
to  predict  the  average  queuing  delay  by  calculating  the  ratio  of  mean  queue  length 
and  service  rate.  We  consider  a  constant-bit  rate  traffic  as  our  probing  process. 
As  shown  in  Figure  4.16,  the  CBR  process  sends  out  a  small  packet  every  2"  A  sec 
persistently  and  shares  the  bandwidth  with  the  background  traffic.  Assume  that 
the  packet  size  of  the  CBR  process  is  set  to  be  small  enough  such  that  mean 
rate  of  CBR  traffic  is  negligible.  According  to  the  approximation  of  steady  state 
queue  length  distribution  at  ,  the  average  queuing  delay  D  of  CBR  packet  can  be 
predicted  by  the  following  proposition. 


Figure  4.16:  The  target  process  and  the  background  traffic 

Proposition  4.4.1  Assume  a  FIFO  queue  is  in  the  steady  state  with  (time  aver¬ 
age)  distribution  Pr[Q  >  b\.  The  average  queuing  delay  of  CBR  packets  is: 

D  =  ^Y,Pr^>h^  (4-42) 

b>0 

where  '}fb>0  ^>r'[Q  >  b\  is  equal  to  the  mean  (time  average)  queue  length  of  the 
FIFO  queue  with  infinite  buffer  size. 

Sketch  of  Proof.  Since  the  CBR  packets  arrive  at  the  queue  every  2”  A sec,  the 
average  queue  length  seen  by  the  CBR  packets  is  equal  to  the  time  average  of 
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queue  length.  The  average  packet  delay  at  a  FIFO  queue: 


D  =  S  +  W  (4.43) 

=  ^EPr[Q>6]’  (4-44) 

b>  0 

where  W  and  S  mean  waiting  time  and  mean  service  time  respectively.  Note  that 
the  CBR  packet  size  is  assumed  to  be  small  and  negligible,  we  assume  S  —  0. 

□ 

Figure  4.17  to  4.23  demonstrate  the  qnene  length  distribution  which  is  seen 
by  CBR  packets.  These  figures  also  show  that  the  analysis  approximation  indeed 
provide  an  accurate  prediction  for  delay  distribution  of  CBR  packets.  Figure  4.24 
shows  the  prediction  of  mean  packet  delay  with  different  utilization. 
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Figure  4.17:  Queue  length  distribution  seen  by  CBR  packet  p  =  0.191 
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p=0.293 


Figure  4.18:  Queue  length  distribution  seen  by  CBR  packet  p  =  0.293 
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Figure  4.19:  Queue  length  distribution  seen  by  CBR  packet  p  =  0.383 
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Figure  4.20:  Queue  length  distribution  seen  by  CBR  packet  p  =  0.468 
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Figure  4.21:  Queue  length  distribution  seen  by  CBR  packet  p  =  0.573 
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p=0.661 


Figure  4.22:  Queue  length  distribution  seen  by  CBR  packet  p  =  0.661 
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Figure  4.23:  Queue  length  distribution  seen  by  CBR  packet  p  =  0.771 
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Figure  4.24:  Mean  CBR  packet  delay 


For  real  time  applications  such  as  video/voice  conference/stream  and  voice  over 
IP,  the  delay  jitter  plays  an  more  important  role  in  the  connection  quality.  The 
jitter  is  mainly  caused  by  the  perturbation  of  background  traffic  in  the  bottleneck 
queue.  Fulton  and  Li  [83]  provided  analytical  approximations  for  the  first-order 
and  second-order  statistics  of  the  delay  jitter.  However,  like  other  the  jitter  analysis 
methods  [84]  [85]  [86],  their  works  are  based  on  the  Markovian  model  of  background 
traffic,  which  is  not  suitable  for  the  long  range  dependent  process  such  as  Internet 
traffic.  The  Markovian  approach  also  requires  a  lot  of  computational  efforts  and 
memory  space  for  conditional  probabilities  and  matrices  .  We  develop  an  efficient 
method  to  predict  the  variance  of  jitter  based  on  the  the  Logscale  diagram  of  the 
traffic. 

We  consider  the  CBR  traffic  in  the  previous  scenario  of  Figure  4.16.  The  CBR 
process  is  employed  here  to  estimate  the  delay  jitter  of  the  bottleneck  queue.  The 
Logscale  diagram  of  the  background  traffic  is  given  as  Lj.  The  delay  jitter  is 
defined  as  follows: 

Definition  4.4.2  Let  the  random  sequence  Ii  be  the  interdeparture  time  of  the  tar¬ 
get  process,  the  jitter  is  defined  as  the  difference  of  two  consecutive  interdeparture 
times: 


Jr 


Li+ 1 


l. 


(4.45) 


We  also  define  Anj  as  the  total  arrival  bytes  of  the  background  traffic  in  the 
ith  time  slots.  The  duration  of  each  time  slot  is  T  :=  2nA  sec. 

Proposition  4.4.3  Let  the  current  time  be  t —IT  andb=2n+1A(C  —  m).  Assuming 
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Figure  4.25:  The  arrival  and  departure  time  of  CBR  traffic 
that  the  current  queue  length  is  q(t )  >  b.  The  conditional  variance  of  jitter  is 

Var[J\q(t)  >  b\  =  -^V ar[(An,i+1  -  AHii)].  (4.46) 

Proof  Without  loss  of  generality,  let  the  CBR  packets  arrive  at  time  0,  T,  and  2 T, 
which  have  queuing  delay  do,  di  and  o?2,  respectively.  As  shown  in  Figure  4.25,  the 
total  arrival  bytes  of  the  background  traffic  in  the  ith  time  slots  (t  G  \iT,  (i  +  1)T)) 
is  Anj.  Since  the  current  length  g(0)  >6  is  quite  large,  it  is  reasonable  to  say  that 
the  output  link  is  always  busy  during  the  0th  and  1th  time  slots.  Moreover,  the 
buffer  size  is  infinite  so  that  there  is  no  packet  loss  event.  The  Lindley  equation: 

q(t)  =  maxo<s<t[A(f)  -  A(s)  -  C(t  -  s)],\/t  >  0  (4.47) 

can  be  simplified  as 

q((i  +  1  )T)  =  q(iT)  +  -  CT.  (4.48) 

Hence,  the  packet  delay  di  =  q{iT)/C  and  the  interdeparture  time  is 

h  —  T  +  di+ 1  —  di 
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(4.49) 


q((i  +  1)T)  -  q(iT) 
C 


C  ■ 

The  jitter  variance  under  this  condition  is 


Var[J\q(t)  >  b]  =  Var(Ii+1  -  h) 

V  or  [(j4nij_|_i  An!j)] 
C2  ' 


(4.50) 


□ 


According  to  the  definition  of  Haar  wavelet,  one  may  easily  obtain  the  value  of 
V ar[{An  i  —  An:0)]  from  the  Logscale  diagram  Lj 


2L,+1  =  E\d%lik]  = 


2^ 


(4.51) 


for  every  j  and  k. 

Hence,  the  conditional  jitter  variance  of  the  CBR  process  is 

2n+l+L„+i 

Var[J\q(t)  >  b]  =  - — - .  (4.52) 

On  the  other  hand,  if  the  current  queue  length  is  small  ( q(t )  <  b),  we  assume 
that  there  is  at  least  one  idle  server  event  happening  in  the  next  two  time  slots. 
The  simple  relationship  of  equation  (4.48)  and  (4.49)  does  not  hold.  Since  there 
is  at  least  one  idle  event  (queue  empty  event)  in  this  period,  the  sequence  of 
interdeparture  times  A  and  the  sequence  of  packet  delays  di  can  be  treated  as 
uncorrelated  random  sequences  respectively.  We  have  the  following  upper  bound: 


V ar[J\q{t)  <  b]  =  V ar[Ii+ 1  —  A]  «  2 V ar[Ii\ 


=  2V ar[di+i  —  di]  ~  4 V ar[di\ 

<  A  Var[(Aj-C2J A) +] . 


(4.53) 
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Proposition  4.4.4  Let  A  be  the  Lognormal  random  variable  with  parameter  (/x,  a) 


and  d  >  0  be  any  real  number,  we  have 


E{(A-d) +] 


g^+cr2/ 2 

- erfc( 

2  v 

g2|i-)-2(T2 
— 2 — er/c( 
de^+(j2 12  er  f  c\ 


In  d  —  p  —  a2 .  -. 

Ind  —  p  —  2 a2  ^ 

7!^  } 

t  In  d  —  p  —  <t2  ^ 

(  7^  } 


d2F(d ), 


(4.54) 


(4.55) 


w/iere  F(d )  :=  Pr[A  >  cl] . 

The  probability  of  Pr[q{t )  >  6]  is  based  on  the  prediction  of  the  steady  state 
queue  length  distribution.  We  apply  the  previous  results  of  queue  length  distribu¬ 
tion  in  section  4.3  to  estimate  the  probability  Pr[Q  <  b ]. 


Proposition  4.4.5  Let  the  r.v.  Q  be  the  queue  length  in  steady  state.  From  (4-52) 
(4-53)  and  (4-4),  there  is  an  upper  bound  of  the  jitter  variance  of  the  CBR  traffic 
at  the  bottleneck  router: 


Var(J)  <  Var(J\Q  >  b)Pr[Q  >  b\ 

+Var(J\Q  <  b))Pr[Q  <  b].  (4.56) 

Given  the  Logscale  diagram  of  the  traffic,  we  are  able  to  predict  the  jitter  vari¬ 
ance  immediately  via  (4.56).  We  validate  this  efficient  approach  by  ns2  simulation 
in  the  next  section. 
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4.5  Simulation  results 


The  network  topology  in  this  experiment  is  the  simple  dumbbell  network  with  a 
single  bottleneck  link.  One  side  of  the  bottleneck  link  consists  of  800  web  clients. 
The  clients  send  request  to  web  serve,  wait  response,  receive  data  and  close  the 
session.  The  interval  between  the  end  of  session  and  next  web  request  is  called 
user  think  time.  In  this  simulation  the  user  think  time  of  clients  has  an  Exponen¬ 
tial  distribution  with  mean  50sec,  which  is  related  to  the  human  behavior  in  the 
Internet  [9]. 

The  other  side  has  800  web  servers.  The  server  is  running  HTTP  1.1  protocol 
and  has  a  Pareto  file  size  distribution  with  parameters  (K=2.3Kbytes,  a=1.3). 
The  propagation  delay  of  each  server  link  is  uniformly  distributed  in  the  interval 
(16ms,  240ms)  and  the  mean  round-trip  time  is  about  128ms.  The  aggregated 
traffic  requested  by  web  client  and  generated  by  web  servers  has  a  mean  rate 
about  1.2 Mbps.  It  is  treated  as  the  background  traffic  in  the  previous  analysis. 
The  unique  CBR  source  at  the  server  side  has  a  permanent  connection  cross  the 
bottleneck  link.  The  CBR  packets  are  sent  periodically  every  T  sec.  and  received 
at  the  client  end.  Since  the  link  propagation  delay  of  CBR  connection  is  known, 
the  queuing  delay  and  jitter  of  CBR  packet  is  estimated  straightforwardly.  We 
consider  both  FIFO  and  RED  as  the  buffer  management  policy  at  the  bottleneck 
link  respectively. 

Case  1:  Jitter  at  a  FIFO  Queue 

Figure  4.26  compares  the  predicted  standard  deviation  of  jitter  with  the  simula¬ 
tion  results.  The  target  CBR  process  has  fixed  interarrival  time  T  =  2 nms  and 
n  =  3,4,  ..,8.  As  a  probing  connection,  the  CBR  has  a  small  packet  size  so  that 
the  average  mean  rate  of  CBR  traffic  is  negligible.  Since  the  mean  rate  of  the 


94 


background  traffic  is  about  1.2 Mbps,  the  link  utilization  is  about  0.4,  0.6  and  0.8 
with  the  corresponding  bandwidth  C  =  3.0,  2.0  and  1.5 Mbps  respectively. 

Case  2:  Jitter  at  a  RED  Queue 

We  replace  the  FIFO  queue  by  an  adaptive  RED  queue  at  the  bottleneck  router. 
The  adaptive  RED  queue  [87]  [88]  will  keep  the  average  queue  length  located  in  a 
desired  region  by  randomly  dropping  the  TCP  packets.  Since  the  queue  length  is 
in  the  desired  region,  the  link  has  a  100%  utilization  and  no  idle  event  happened. 
Hence,  the  jitter  variance  is  bounded  by  equation  (4.52).  Figure  4.27  shows  that  the 
prediction  method  also  provides  tight  bound  for  the  jitter  variance  with  different 
queuing  policy  such  as  RED  is  employed. 

Since  the  interarrival  time  of  most  time-sensitive  applications  is  located  in  the 
range  of  (8 ms,  256 ms).  Both  simulation  results  show  that  our  prediction  method 
indeed  provides  a  tight  upper  bound  of  jitter  within  this  range. 
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Figure  4.26:  The  standard  deviation  of  delay  jitter  std(J)  v.s.  n  with  a  FIFO 
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Figure  4.27:  The  standard  deviation  of  delay  jitter  std(J)  v.s.  n  with  a  RED  queue 


4.6  Summary 


In  this  chapter  we  first  analyze  the  multilevel  ON / OFF  model  and  provide  a  the¬ 
oretical  approximation  of  Logscale  diagram.  According  to  the  property  of  wavelet 
analysis,  the  second  order  statistics  of  traffic  can  be  immediately  extracted  from 
the  corresponding  Logscale  diagram.  Given  the  mean  and  variance  of  traffic  work¬ 
load  at  every  time  scale,  we  are  looking  for  a  mathematical  distribution  which 
can  best  fit  the  empirical  distribution  of  real  workload.  After  we  employed  the 
Kolmogorov- Smirnov  goodness-of-fit  test,  the  Lognormal  distribution  is  chosen 
as  the  best  candidate.  The  steady  state  queue  length  distribution  is  approximated 
according  to  the  workload  distribution  at  every  time  scale. 

The  queuing  delay  and  jitter  are  also  investigated  on  the  basis  of  the  approxi¬ 
mation  of  queue  length  distribution.  The  mean  delay  is  directly  predicted  by  the 
mean  queue  length  and  the  link  bandwidth.  An  upper  bound  of  jitter  variance 
is  developed  according  to  the  property  of  wavelet  analysis  and  a  simple  condition 
probability.  All  theoretical  results  such  as  queue  length  distribution,  mean  delay 
and  jitter  variance  are  validated  by  simulation. 

The  proposed  traffic  model  and  the  analysis  in  statistical  property  and  queu¬ 
ing  behavior  provide  a  simple  but  efficient  platform  to  understand  the  network 
engineering.  It  explains  how  the  network  parameters  such  as  the  active  time, 
round-trip  time,  session  life  time  and  user  think  time  affect  the  traffic  statistic 
at  different  time  scales.  Furthermore,  it  also  helps  us  understand  the  impacts 
of  network  parameters  on  connection  performance  such  as  packet  loss,  delay  and 
jitter. 

In  the  next  chapter  we  focus  on  the  active  queue  management  (AQM)  policy 
at  the  bottleneck  router.  A  parallel  structure  and  a  new  idea  of  flow  control  are 


proposed  for  providing  a  better  quality  of  service. 
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Chapter  5 


Parallel  Queue  Structure  for 
Active  Queue  Management 

5.1  Overview 

For  small  queuing  delay,  the  buffer  size  in  a  ronter  is  not  large.  However,  a  router 
with  small  buffer  size  often  has  a  high  packet  dropping  rate  since  the  Internet 
traffic  is  bursty.  When  packets  are  lost,  the  TCP  protocol  dramatically  reduces 
the  flow  rate  during  the  congestion  avoidance  phase  [89].  Therefore,  after  a  buffer 
overflow  event  in  a  drop-tail  queue,  all  connections  sense  packet  loss  and  slow  down 
the  transfer  rate  simultaneously.  In  order  to  prevent  this  global  synchronization 
phenomenon  and  increase  link  utilization,  many  active  queue  management  schemes 
such  as  RED  (Random  Early  Detection)  [41]  have  been  proposed  and  received 
increasing  attention. 

The  basic  idea  of  RED  is  to  randomly  drop  packets  to  prevent  buffer  over¬ 
flow  and  the  global  synchronization  problem.  The  dropping  probability  is  a  non- 
decreasing  function  of  the  queue  length.  A  TCP  connection  with  a  higher  flow 
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Figure  5.1:  The  dropping  probability  function  of  original  RED. 

rate  has  a  better  chance  to  get  packets  dropped  and  reduce  its  rate  more  rapidly. 
By  dropping  packets  actively,  RED  keeps  the  queue  length  within  a  desired  region. 
However,  some  simulation  and  analysis  results  [90]  [39]  [91]  [92]  have  demonstrated 
that  the  performance  of  RED  is  very  sensitive  to  parameter  settings.  Based  on 
the  original  idea  of  RED,  there  have  been  some  modifications  such  as  Stabilized 
RED  (SRED)  [93],  Flow  RED  (FRED)  [94],  Weighted  RED  [95],  Random  Early 
Marking  (REM)  [38],  BLUE  [40]  and  Adaptive  RED  [88]  [87].  The  Adaptive  RED 
scheme  dynamically  updates  the  maximum  dropping  probability  according  to  the 
exponentially  weighted  moving  average  (EWMA)  of  the  queue  length,  and  makes 
itself  more  robust  with  respect  to  the  congestion  level. 

The  Adaptive  RED  policy  provides  good  rate  control  for  TCP  connections 
operating  in  the  congestion  avoidance  phase  [75]  [87].  However,  a  great  portion  of 
Internet  traffic  is  web  and  LIDP  traffic.  Since  most  web  connections  involve  transfer 
of  several  small  files,  these  connections  have  a  short  life  and  are  mostly  operated 
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in  the  TCP  slow  start  phase  with  a  small  congestion  window.  Dropping  web 
packets  in  this  phase  is  not  an  effective  way  to  control  the  traffic  rate  and  alleviate 
the  congestion  at  the  bottleneck  router.  Furthermore,  from  the  viewpoint  of  a 
web  user,  one  or  several  packet  losses  in  the  slow  start  phase  would  lead  to  extra 
delay  for  retransmission  or  even  TCP  timeout.  It  would  also  force  TCP  to  enter 
the  congestion  avoidance  phase  prematurely  with  a  small  congestion  window  and 
result  in  a  low  throughput.  The  delay  and  low  throughput  would  severely  degrade 
the  performance  of  delivering  short  messages  such  as  web  pages,  and  web  browsers 
experience  long  waiting  times  even  with  a  high  speed  network.  On  the  other  hand, 
the  Adaptive  RED  fails  to  maintain  the  queue  length  within  the  desired  region 
due  to  the  bursty  nature  of  web  traffic. 

To  address  these  problems,  we  propose  a  parallel  virtual  queue  structure  for 
active  queue  management  in  this  paper.  In  this  structure,  real  time  (web,  UDP) 
and  non-real  time  (FTP)  traffic  are  separated  into  two  different  virtual  queues 
which  share  the  same  physical  buffer  memory.  The  drop-tail  policy  is  applied  at 
the  first  virtual  queue  to  serve  real  time  applications.  In  order  to  have  a  small 
mean  delay,  the  service  rate  of  this  drop-tail  queue  is  dynamically  determined 
by  its  virtual  queue  length.  The  remaining  non-real  time  traffic  is  directed  to 
an  Adaptive  RED  virtual  queue.  Simulation  shows  that  this  parallel  virtual  queue 
structure  not  only  has  the  advantages  of  Adaptive  RED  such  as  high  link  utilization 
and  small  delay,  but  also  greatly  reduces  the  total  packet  loss  rate  at  the  router. 
Despite  that  the  bandwidth  is  shared  with  the  bursty  drop-tail  virtual  queue,  the 
Adaptive  RED  queue  has  a  small  length  variation. 

The  original  Adaptive  RED  dynamically  changes  the  maximum  dropping  prob¬ 
ability  Pmax  to  keep  the  queue  length  within  the  thresholds.  However,  for  some 
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non-real  time  applications,  high  goodput  (low  packet  dropping  rate)  is  more  impor¬ 
tant  than  short  packet  delay.  Hence  we  explore  a  modified  Adaptive  RED  policy 
for  the  non-real  time  applications  at  the  second  virtual  queue,  where  the  queue 
length  thresholds  are  dynamically  adjusted  to  maintain  the  dropping  probability 
of  Adaptive  RED  algorithm  in  a  desired  range. 

The  remainder  of  the  dissertation  is  organized  as  follows.  In  Section  5.2,  we 
demonstrate  the  vulnerability  of  the  Adaptive  RED  in  the  presence  of  web  and 
UDP  traffic.  The  parallel  virtual  queue  structure  is  described  in  Section  5.3.  Com¬ 
parison  of  this  approach  with  the  original  Adaptive  RED  scheme  is  given  through 
simulation  in  Section  5.4.  In  Section  5.5,  we  present  the  modified  Adaptive  RED 
policy  with  dynamic  queue  length  thresholds.  Performance  and  stability  analysis 
is  provided  for  this  parallel  structure  with  the  drop-tail  and  the  modified  Adaptive 
RED  queue  in  Chapter  6.  Finally,  we  conclude  our  work  in  Chapter  7. 

5.2  Vulnerability  of  Adaptive  RED  to  Web-mice 

In  this  section  we  consider  a  scenario  containing  short-life  TCP  (WEB),  UDP 
(CBR)  and  long-life  TCP  (FTP)  traffic.  The  purpose  is  to  demonstrate  that  the 
performance  of  the  Adaptive  RED  scheme  is  severely  degraded  by  the  short-life 
web  traffic.  The  network  in  our  ns2  experiment  is  the  same  scenario  of  delay 
and  jitter  analysis  in  Chapter  4  except  the  FTP  traffic.  It  has  a  simple  dumb¬ 
bell  topology  with  the  bottleneck  link  bandwidth  C=3.0Mbps.  One  side  of  the 
bottleneck  consists  of  800  web  clients.  Each  client  sends  a  web  request  and  has  a 
think  time  of  Exponential  distribution  with  mean  50s  after  the  end  of  each  session. 
The  other  side  contains  800  web  servers,  running  HTTP  1.1  protocol  and  having  a 
Pareto  [16]  file  size  distribution  with  parameters  ( Kp—2.3Kbytes ,  a— 1.3)  (mean 
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10 Kbytes).  The  round-trip  propagation  delay  of  HTTP  connections  is  uniformly 
distributed  in  (16,  240)ms.  Note  that  the  mean  rate  of  the  aggregate  web  traffic 
is  around  1.2 Mbps.  There  is  one  CBR  traffic  source  which  periodically  gener¬ 
ates  a  1  Kbytes  UDP  packet  every  50ms.  Besides  these  short  web  connections 
and  UDP  traffic,  there  are  10  persistent  FTP  connections  sharing  the  bottleneck 
link  with  round-trip  propagation  delay  of  64ms.  Figure  5.2  shows  that  the  Adap¬ 
tive  RED  works  well  with  those  FTP  connections  before  the  web  traffic  comes  in. 
However,  after  the  CBR  source  and  web  servers  begin  to  share  the  bandwidth  at 
time  f=100s,  the  queue  length  of  Adaptive  RED  deviates  dramatically  from  the 
desired  region.  Since  the  Adaptive  RED  scheme  relies  on  average  queue  length 
to  determine  the  dropping  probability  and  control  the  TCP  flow  rate,  the  extra 
queue  length  perturbation  contributed  by  the  bursty  web  traffic  makes  the  Adap¬ 
tive  RED  increase/decrease  its  dropping  probability  rapidly.  This  over-reaction 
causes  a  great  queue  length  variation  and  poor  performance  in  packet  delay  and 
loss. 

Since  most  web  pages  contain  one  or  several  very  small  files,  these  TCP  con¬ 
nections  are  mostly  operated  in  their  slow  start  phase  during  the  session  life.  Ac¬ 
cording  to  the  TCP  protocol,  the  congestion  control  window  is  just  beginning  to 
increase  its  size  from  the  initial  value  and  the  flow  rate  is  low.  Dropping  packets  in 
the  slow  start  phase  cannot  efficiently  alleviate  the  congestion  level  at  the  bottle¬ 
neck  router.  In  other  words,  any  random  dropping/marking  policy  such  as  RED 
is  unable  to  effectively  control  the  congestion  level  without  considering  short-life 
TCP  (and  UDP)  traffic.  Furthermore,  losing  one  or  two  packets  in  the  slow  start 
phase  not  only  causes  a  very  low  throughput  and  extra  delay,  but  also  leads  to  a 
high  probability  of  connection  timeout.  This  is  further  illustrated  below. 
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Figure  5.2:  Queue  length  of  the  Adaptive  RED:  10  FTP  starting  at  t=0  and  800 
WEBs  and  1  CBR  coming  in  at  t=100s. 
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From  the  viewpoint  of  a  web  browser,  a  short-life  TCP  session  may  only 
need  several  round-trip  times  (RTT)  to  finish  the  whole  transmission.  When  the 
sender  senses  a  packet  loss,  the  slow  start  threshold  ssthresh  will  be  reduced  to 
min(cwnd,  rev -window)  /  2  [89]  and  the  new  congestion  window  size  cwnd  is  also 
decreased  depending  on  different  TCP  versions  (For  TCP  Reno,  the  new  cwnd  = 
ssthresh  and  TCP  enters  the  fast  recovery  phase.  For  TCP  Tahoe,  cwnd,  =  MSS 
(maximum  segment  size)  and  TCP  begins  a  new  slow  start  phase).  Since  original 
cwnd,  is  just  beginning  to  increase  its  size  from  its  initial  value  MSS  in  the  first 
slow  start  phase,  one  packet  loss  during  the  initial  several  round-trip  times  leads 
TCP  to  enter  the  congestion  avoidance  phase  with  a  very  small  ssthresh  and  cwnd. 
In  the  congestion  avoidance  phase,  TCP  slowly  increases  cwnd,  (the  increment  is 
about  one  A4SS  per  round-trip  time)  from  the  current  ssthresh.  Therefore,  losing 
one  packet  in  the  slow  start  phase  (as  shown  in  Figure  5.3)  takes  TCP  a  long  time 
to  complete  a  short  message.  In  addition,  since  the  web  traffic  is  short  but  bursty, 
these  web  connections  usually  experience  a  higher  packet  loss  rate  (see  the  web 
packet  loss  rates  of  the  Adaptive  RED  and  the  drop-tail  policies  in  Table  5.3). 

The  default  initial  value  of  ssthresh  is  64 KB  and  the  packet  size  is  1  KB  in 
this  paper.  Assuming  a  typical  packet  dropping  probability  Pd=0.04  when  using 
the  Adaptive  RED,  the  probability  of  losing  one  or  more  packets  in  the  slow  start 
phase  is  equal  to  1  —  (1  —  P^)64  =  0.9267  (assuming  that  packets  are  dropped  inde¬ 
pendently).  Therefore,  most  TCP  connections  have  at  least  one  packet  dropped  in 
their  first  slow  start  phase.  For  example,  assuming  that  the  15th  packet  is  dropped 
by  the  Adaptive  RED,  ssthresh  decreases  from  64KB  to  4/1 P  and  the  new  con¬ 
gestion  window  cwnd,  is  decreased  from  8 KB  to  1  KB  (Tahoe).  The  situation  gets 
worse  if  one  packet  is  dropped  earlier  (in  the  first  3  round-trip  times).  The  con- 
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Figure  5.3:  Congestion  window  size  of  TCP  Reno:  One  packet  loss  in  the  Slow 
Start  phase  (left)  and  One  packet  loss  in  the  Congestion  Avoidance  phace  (right). 

gestion  window  at  this  moment  is  so  small  that  the  sender  may  not  have  enough 
data  packets  to  trigger  the  receiver  to  generate  three  duplicate  acknowledgements. 
If  packets  cannot  be  recovered  by  this  fast  recovery  scheme,  TCP  has  to  depend 
on  the  protocol  timer  for  error  recovery.  The  default  value  of  the  protocol  timer 
is  usually  large  and  the  delivery  delay  could  be  increased  dramatically  by  timeout 
events.  Moreover,  the  probability  of  losing  two  or  more  packets  of  the  same  con¬ 
gestion  window  in  the  slow  start  phase  also  cannot  be  ignored.  These  events  lead 
to  a  high  probability  of  TCP  timeout  and  connection  reset. 

For  illustration  we  conduct  simulation  of  transferring  a  small  web  hie  in  a  stand 
alone  and  one  hop  environment.  There  is  no  other  traffic  sharing  the  bandwidth 
and  packets  are  dropped  intentionally.  Figure  5.4  shows  the  mean  delivery  delay 
v.s.  the  dropping  probability  for  hie  sizes  30K B-210K B ,  and  Table  5.1  lists  the 
mean  and  standard  deviation  of  the  delay.  For  example,  TCP  takes  about  4.81s 
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to  complete  transmission  of  a  90 KB  file  if  Pd  =  0.04;  in  comparison,  in  the  loss 
free  case,  the  hie  can  be  delivered  in  1.18s.  Since  most  web  pages  have  sizes  in 
the  above  range,  a  web  browser  will  experience  a  long  response  time  when  the 
dropping  probability  of  the  Adaptive  RED  is  high. 


Table  5.1:  Delivery  delay  of  small  hie:  mean  and  standard  deviation 


Pd 

0.00 

0.02 

0.04 

0.08 

30KB 

0.88(.0006) 

1.60(1.88) 

2.74(4.27) 

5.88(7.79) 

90KB 

1.18(.0008) 

2.79(2.39) 

4.81(3.91) 

9.24(6.30) 

150  KB 

1.34  (0.0008) 

3.51(1.90) 

6.51(4.60) 

13.38(8.87) 
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Figure  5.4:  Mean  delivery  delay  of  small  file  v.s.  dropping  probability  Pci  with  file 
sizes  30,60,  ...,210 Kbytes,  bandwidth  3 Mbps  and  round-trip  time  128ms. 


109 


5.3  A  Parallel  Virtual  Queues  Structure 


To  solve  the  problem  discussed  in  Section  II,  we  propose  a  parallel  virtual  queue 
structure  in  the  router.  The  first  virtual  queue  deals  with  the  short-life  real¬ 
time  traffic  (web,  UDP).  Since  dropping  these  packets  cannot  effectively  alleviate 
the  congestion  level,  but  severely  increases  delivery  delay,  it  would  be  good  to 
keep  them  in  the  queue  unless  the  total  buffer  (shared  with  the  other  queue)  has 
overflowed.  Hence,  the  queuing  policy  of  the  first  virtual  queue  is  chosen  to  be 
drop-tail  to  minimize  the  packet  loss  rate.  In  order  to  have  a  short  delivery  delay  for 
web  browsers  and  UDP  connections,  the  service  rate  C\  (t)  is  changed  dynamically 
according  to  its  virtual  queue  length  q\{t). 

The  second  virtual  queue  serves  long-life  TCP  connections  such  as  FTP  with 
large  hie  sizes,  where  the  Adaptive  RED  is  used.  Although  the  available  band¬ 
width  of  this  queue  is  determined  by  C2(t)=C-Ci(t),  the  Adaptive  RED  scheme  is 
expected  (and  will  be  verified  by  simulation  in  Section  5.4  to  keep  its  virtual  queue 
length  q2  (t)  in  a  desired  region  for  the  following  reason.  When  there  is  a  heavy 
workload  at  the  drop-tail  queue,  C-zit)  decreases  quickly.  FTP  receivers  experience 
slower  packet  arrival  rates  and  send  acknowledgement  packets  (ACK)  back  more 
slowly.  Without  increasing  the  dropping  probability  at  the  Adaptive  RED  queue, 
the  slower  ACK  arrival  rates  from  the  receivers  make  FTP  senders  reduce  flow 
rates  automatically  without  shrinking  their  congestion  window  sizes.  On  the  other 
hand,  when  the  congestion  level  is  alleviated,  the  Adaptive  RED  queue  receives 
more  bandwidth.  Since  the  congestion  window  sizes 

are  still  large  in  the  FTP  servers,  the  throughputs  of  FTP  is  quickly  recovered 
by  faster  arrival  rates  of  ACK  packets  from  the  receivers. 

With  this  parallel  virtual  queue  structure  (which  will  be  called  RED+Tail  pol- 
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icy  in  this  paper),  we  can  keep  the  benefits  of  Adaptive  RED  such  as  high  (100%) 
link  utilization.  Furthermore,  the  packet  loss  rate  of  the  short-life  TCP  and  UDP 
connections  is  greatly  reduced  by  the  drop-tail  policy  and  a  shared  buffer.  The 
packet  loss  rate  of  long-life  TCP  traffic  is  also  reduced  due  to  the  suppressed 
bandwidth,  larger  thresholds  (longer  RTT)  and  more  stable  average  virtual  queue 
length  for  the  Adaptive  RED  queue. 

We  now  discuss  how  to  implement  the  RED+Tail  policy.  The  first  problem  is 
how  to  split  the  long-life  traffic  from  other  short-life  web  traffic  at  the  router.  To 
this  end,  the  router  has  to  know  the  age  or  elapsed  time  of  each  TCP  connection. 
Unfortunately,  this  information  is  hidden  in  the  TCP  header  which  is  not  available 
to  the  IP  router.  However,  one  may  roughly  estimate  the  elapsed  time  by  using 
the  following  approach: 

•  When  a  packet  arrives  with  a  new  source-destination  pair  which  has  not  been 
seen  by  the  router  in  the  past  T  sec,  we  treat  it  as  a  new  TCP  connection 
and  identify  this  connection  as  a  short-life  connection; 

•  Send  the  new  connection  packets  to  the  drop-tail  queue; 

•  Set  a  counter  for  the  number  of  packets  of  this  connection; 

•  If  the  cumulative  packets  number  is  greater  than  a  threshold  N ,  we  assume 
that  the  file  size  is  large  enough  and  this  TCP  connection  has  already  left 
its  slow  start  phase.  We  redirect  the  subsequent  packets  of  this  connection 
to  the  Adaptive  RED  queue; 

•  Remove  the  counter  if  there  is  no  packet  arrival  in  the  last  T  sec. 

Preliminary  simulation  results  show  that  this  approach  successfully  prevents  small 
web  traffic  from  entering  the  RED  queue.  The  probability  of  false  alarm  is  less 
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than  0.02  in  our  scenario.  Since  the  web  traffic  has  small  hie  sizes  and  short  session 


times,  there  is  no  harm  if  the  short-life  connection  packets  are  misdirected  to  the 
RED  queue  after  time  T. 


Figure  5.5:  The  parallel  virtual  queue  structure  for  active  queue  management. 

Figure  5.5  shows  the  RED+Tail  parallel  queue  structure  in  the  router.  Recall 
that  C\(t)  and  C2(t)  denote  the  service  rates  of  the  drop-tail  queue  and  the  Adap¬ 
tive  RED  queue  at  time  t  respectively.  In  order  to  allocate  bandwidth  dynamically 
to  both  queues  and  assign  a  desired  region  of  queue  length  for  the  Adaptive  RED 
queue,  we  define  the  maximum  threshold  maxthi  and  minimum  threshold  minthi 
for  i  —  1,2.  The  service  rates  C\(t)  and  C'2(t)  are  given  by  the  following  algorithm: 

•  if  qi  —  0,  then  Ci(t)  :=  0. 

•  if  0  <  qi  <  minthi ,  then  Ci(t):—Cimin. 

•  if  minthi  <  qi,  then  Ci(t)\=min(C m^thl ,  Cimax). 

•  C2(t)  :=  C  —  Ci(t), 

where  C  is  the  link  bandwidth.  The  variable  q\  denotes  the  queue  length  of  the 
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drop-tail  queue.  The  constant  C\max  preserves  the  minimum  available  bandwidth 
C  —  Cimax  for  the  RED  queue  to  prevent  FTP  connections  from  timeout. 


5.4  Simulation  and  Comparison 

In  this  section,  we  compare  the  RED+Tail  scheme  with  the  Adaptive  RED  on  typ¬ 
ical  TCP  performance  metrics.  For  the  Adaptive  RED,  we  use  the  parameter  set¬ 
tings  suggested  by  Floyd  et  al  [87]  (a  and  (3  of  the  AIMD  algorithm).  Both  schemes 
were  implemented  in  the  ns2  simulator.  The  network  topology  and  scenario  are  as 
described  in  Section  II.  Table  5.2  lists  the  parameters  for  the  RED+Tail  policy  and 
the  Adaptive  RED  policy.  Note  that  the  virtual  queues  of  the  RED+Tail  scheme 
share  the  total  physical  buffer  size,  i.e.,  the  packets  in  the  drop-tail  virtual  queue 
will  not  be  dropped  unless  the  physical  memory  is  full.  The  Adaptive  RED  is 
set  in  a  “ gentle ”  mode  meaning  that  the  dropping  probability  between  {rnaxthz, 
2maxth2)  is  linear  in  (Pmax,  !)• 


Table  5.2:  Experiment  Settings 


Virtual  Qu.  i 

Buffer  Size 

minthi 

maxthi 

a 

p 

i  =  1 

160KB 

2KB 

30KB 

- 

- 

i  =  2 

shared 

20KB 

80KB 

0.01 

0.9 

Adapt.  RED 

160KB 

20KB 

80KB 

0.01 

0.9 

The  performance  for  a  connection  is  evaluated  by  the  packet  loss  rate,  delay  and 
throughput.  However,  we  are  more  concerned  about  packet  loss  rate  and  delay 
for  web  (short-TCP)  and  CBR  (UDP)  connections,  and  more  concerned  about 
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Tabic  5.3:  Performance  Metrics 


Policy 

Loss  % 

Delay  Sec. 

Rate  KB/s 

RED+TaihFTP 

2.747 

0.184 

209.465 

RED+Tail:  WEB 

1.278 

0.114 

144.455 

RED+TaikCBR 

0.300 

0.109 

19.867 

AdaptRED:FTP 

4.149 

0.143 

217.531 

Adapt  R  ED :  WEB 

4.514 

0.143 

137.124 

AdaptRED:CBR 

3.950 

0.141 

19.140 

Drop  Tail:  FTP 

1.916 

0.349 

215.243 

Drop  Tail:  WEB 

4.234 

0.340 

138.983 

Drop  Tail:  CBR 

1.550 

0.342 

19.601 

throughput  for  FTP  (long-TCP).  We  replaced  the  Adaptive  RED  with  RED+Tail 
scheme  at  the  router  and  repeated  the  experiment  of  Section  II.  For  comparison, 
an  experiment  with  the  drop-tail  policy  was  also  conducted.  The  random  seed 
of  the  simulator  was  fixed  so  that  the  processes  of  web  requests  and  hie  sizes 
had  the  same  sample  paths  in  all  experiments.  Table  5.3  lists  the  performance 
metrics  under  RED+Tail,  the  Adaptive  RED  and  the  traditional  drop-tail  scheme 
respectively. 

Figure  5.6  shows  the  queue  lengths  of  the  RED+Tail  scheme,  which  demon¬ 
strates  that  the  virtual  queue  length  q2  is  quite  stable  and  stays  in  the  desired 
region  even  after  the  web  and  CBR  traffic  begins  to  share  the  bandwidth  at  time 
f=100s.  The  actual  dropping  probability  for  the  FTP  traffic  is  reduced  from  4.15% 
to  2.75%  by  a  longer  queuing  delay  (184ms,  see  Table  III).  This  scheme  prevents 
the  over-reaction  behavior  of  RED  in  the  original  Adaptive  RED  case  and  keeps 


114 


the  mean  queue  length  q2  in  a  desired  region  (Compare  to  Figure  5.2). 

Figure  5.7  shows  the  packet  loss  rates  of  FTP,  web  and  CBR  connections  under 
the  Adaptive  RED  and  RED+Tail  schemes.  We  see  that  RED+Tail  provides  great 
improvement  in  packet  loss  for  web  and  CBR  connections.  The  web  packet  loss 
rate  is  reduced  from  4.51%  to  1.28%  and  CBR  packet  loss  rate  is  reduced  from 
3.95%  to  0.30%. 

Figure  5.8  compares  the  packet  delays.  The  mean  queuing  delay  of  web  and 
CBR  packets  in  the  RED+Tail  scheme  is  shortened  at  the  cost  of  the  FTP  pack¬ 
ets  delay.  The  web  and  CBR  packet  delay  depends  on  how  much  bandwidth  is 
allocated  to  the  drop-tail  queue.  One  can  satisfy  a  mean  delay  requirement  for 
the  web  and  CBR  connections  by  properly  adjusting  the  parameter  maxthi.  For 
example,  the  maxthi  of  the  RED+Tail  scheme  is  set  to  be  30Kbytes  so  that  the 
estimate  of  mean  delay  at  the  drop-tail  queue  is  about  80ms.  However,  the  service 
rate  C\  reaches  its  maximum  C imax  when  qi  >  maxthi.  The  actual  mean  delay 
should  be  larger  than  expected.  For  our  simulation  the  mean  delay  of  web  and 
CBR  traffic  is  around  110ms  (refer  to  analysis  in  Section  6.1). 

Figures  5.9  and  5.10  show  the  throughputs  of  FTP,  web  and  CBR  traffic.  Both 
schemes  achieve  100%  utilization  of  the  link  bandwidth.  Due  to  the  bandwidth 
allocation  scheme  in  the  RED+Tail  scheme,  FTP  has  a  slightly  smaller  throughput. 
However,  the  saved  bandwidth  allows  web  burst  to  pass  through  the  bottleneck 
link  faster. 

Figure  5.11  compares  the  small  web  hie  delivery  time  under  different  schemes. 
Since  the  RED+Tail  policy  has  a  small  packet  loss  rate,  its  delivery  time  is  almost 
equal  to  the  loss  free  case  in  Table  5.1.  On  the  other  hand,  the  Adaptive  RED  has  a 
loss  rate  4.5%,  its  delivery  time  is  three  times  longer.  Note  that  the  drop-tail  queue 
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has  a  similar  loss  rate  (4.2%)  as  Adaptive  RED  for  web  packets.  However,  the  hie 
delivery  time  of  the  drop-tail  scheme  is  about  2.5  times  longer  than  Adaptive 
RED’s.  This  is  mainly  due  to  the  long  queuing  delay  (0.340sec)  of  the  drop-tail 
policy. 
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Figure  5.6:  Queue  lengths  of  RED+Tail  virtual  queues:  10  FTPs  starting  at  t=0 
go  to  virtual  queue  2,  and  800  WEBs  +  1  CBR  starting  at  t=100  go  to  virtual 
queue  1. 


117 


Adaptive  RED  RED+Tail 


Figure  5.7:  Packet  losses  (packets/sec.)  of  Adaptive  RED  and  RED+Tail. 


118 


Adaptive  RED  RED+Tail 


0_ 

H  0.5 
Ll_ 

0.5 

i  :  $  •  i ; ;  <  >  I  ;  ;  !■;  -i  •  ■ 

•  :  + 1. .  . ;  ■ 

0 

0 

0t - ■ - ■ -  o' - ■ - 

O  100  200  300  400  0  100  200  300  400 


400 


Figure  5.8:  Packet  delays  (sec.) 


of  Adaptive  RED  and  RED+Tail. 
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Figure  5.9:  Throughputs  (KBytes/sec.)  of  Adaptive  RED  and  RED+Tail. 
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Figure  5.10:  Accumulated  throughputs  of  Adaptive  RED  and  RED+Tail. 
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Figure  5.11:  Small  file  delivery  delay:  mean  and  standard  deviation. 
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5.5  Dynamic  Thresholds  for  Adaptive  RED 


The  original  Adaptive  RED  dynamically  adjust  the  maximum  dropping  probability 
Pmax  (or  equivalently,  the  slope  of  dropping  function)  to  control  the  flow  rates  of 
TCP  connections  and  keep  the  average  queue  length  in  a  desired  region.  However, 
for  those  applications  with  large  file  sizes,  the  goodput  is  more  important  than  the 
packet  delay.  The  packet  loss  rate  is  a  key  factor  in  determining  the  connection 
goodput.  Since  the  minimum  and  maximum  thresholds  of  the  original  Adaptive 
RED  scheme  are  fixed,  the  dropping  probability  of  Adaptive  RED  could  be  very 
high  when  a  congestion  happens.  This  high  dropping  probability  causes  frequent 
re- transmissions,  small  average  congestion  window  size  and  low  goodput.  In  other 
words,  the  congestion  in  bottleneck  router  causes  another  bottleneck  at  the  TCP 
sender  end.  Considering  that  the  Adaptive  RED  queue  is  designed  for  serving  time 
insensitive  connections,  we  propose  to  control  the  TCP  flow  rate  by  adjusting  its 
queuing  delay  instead  of  dropping  packets. 

To  maintain  a  low  packet  loss  rate  (and  a  large  average  congestion  window  size 
at  the  TCP  sender),  the  following  modified  Adaptive  RED  scheme  for  the  Adaptive 
RED  queue  is  proposed,  where  minth 2  and  maxth2  are  dynamically  adjusted  while 
D—maxth2  —  minth2  is  maintained  constant: 

•  Pick  0  <  7  <  1  (7=0.05  in  this  paper). 

•  If  Pd  >  Pjj,  then  minth2  :=  minth2(l  +  7),  maxth2  :=  minth2  +  D. 

•  If  Pd  <  Pl ,  then  minth2  :=  minth2{  1  —  7),  maxth2  :=  minth2  +  D, 

where  Pd  is  the  average  dropping  probability  obtained  by  the  EWMA  algorithm 
and  ( Pl ,  Pu )  is  the  desired  region  of  dropping  probability.  Note  that  if  we  set 
Pu  <  Pmax,  the  floating  thresholds  do  not  change  the  current  slope  of  dropping 
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probability  function  dramatically,  since  the  distance  between  the  thresholds  is 
fixed. 

The  rationale  behind  the  above  scheme  is  that,  by  increasing  the  thresholds 
(when  Pd  >  Pjj),  the  queuing  delay  is  increased  and  the  flow  rates  are  reduced. 
Since  the  average  TCP  throughput  [96]  [97]  is  proportional  to  BTB^pr,  we  achieve 
the  same  throughput  without  raising  the  packet  loss  rate.  Figures  5.12  and  5.13 
compare  the  Adaptive  RED  schemes  with  fixed  and  dynamic  thresholds  respec¬ 
tively.  There  are  20  persistent  FTP  servers  sharing  a  6 Mbps  bottleneck  link. 
Another  20  FTP  servers  arrive  at  time  100s  and  leave  at  time  300s.  It  can  be 
seen  that  the  fixed  threshold  scheme  has  a  small  qnene  length  variation  and  a 
large  dropping  probability  (0.05).  In  contrast,  the  dynamic  threshold  scheme  has 
a  much  lower  average  dropping  probability  (0.014  with  PL= 0.01,  Pu— 0.02),  but 
a  higher  packet  delay.  Note  that  both  schemes  achieve  100%  link  utilization  so 
that  each  FTP  connection  has  the  same  throughput.  However,  with  a  much  lower 
packet  loss  rate,  the  dynamic  threshold  scheme  achieves  a  higher  goodput.  This 
dynamic  threshold  scheme  allows  us  to  consider  the  trade-off  between  packet  loss 
and  queuing  delay  in  an  Adaptive  RED  queue. 

Dynamically  varying  the  thresholds  may  also  have  implications  in  achieving 
throughput  fairness  among  multiple  Adaptive  RED  queues.  Since  the  flow  rates  of 
TCP  connections  are  determined  by  the  corresponding  dropping  probabilities  and 
queuing  delays  at  different  queues,  connections  with  shorter  link  propagation  delays 
and  higher  throughputs  can  be  suppressed  by  raising  the  queue  length  thresholds 
at  the  router. 
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Fixed  Thresholds 


Figure  5.12:  Average  queue  length  with  fixed  and  dynamic  thresholds:  20  FTP 
starting  at  t=0,  and  another  20  FTP  starting  at  t=100s  and  leaving  at  t=300s, 
C=6Mbps,  d/c= 64ms. 
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Fixed  Thresholds 


Figure  5.13:  Dropping  probability  with  fixed  and  dynamic  thresholds:  20  FTP 
starting  at  t=0,  and  another  FTP  20  starting  at  t=100s  and  leaving  at  t=300s, 
C=6Mbps,  dk= 64ms  (Inst.  P:  instantaneous  dropping  probability;  Avg.  P: 
EWMA  average  of  Inst.  P). 
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Table  5.4:  Performance  Metrics :Red+ Tail  with  Dynamic  threshold  scheme 


Policy 

Loss  % 

Delay  Sec. 

Rate  KB/s 

Dyn.  Thres.:FTP 

0.899 

0.318 

209.455 

Dyn.  Thres.:WEB 

2.306 

0.093 

144.505 

Dyn.  Thres.:CBR 

0.519 

0.091 

19.827 

The  ns2  simulation  in  Section  5.4  is  conducted  again  with  the  modified  Adap¬ 
tive  RED  serving  the  second  virtual  queue.  Parameters  (except  minth2  and 
maxth2 ,  which  are  dynamically  adjusted)  used  are  as  listed  in  Table  5.2.  The 
desired  region  of  dropping  probability  for  the  Adaptive  RED  queue  is  set  to  be 
(Pl,Pu)—(Q- 005, 0.010).  Figure  5.14  shows  the  lengths  of  both  virtual  queues  and 
the  dropping  probability  at  the  Adaptive  RED  queue.  The  dropping  probability 
stays  in  the  desired  region  most  of  the  time  as  expected.  Note  that  the  flow  rate 
of  FTP  connections  are  reduced  without  increasing  the  queue  length  q2(t)  and  the 
dropping  probability  dramatically  when  the  bursty  web  traffic  arrives  at  f=100. 
This  is  because  that  the  available  bandwidth  for  FTP  connections  is  reduced  and 
FTP  senders  see  a  longer  round-trip  time  (longer  packet  delay  at  q2,  see  Figure 
5.16). 

Figures  5.15  and  5.16  show  the  packet  losses  and  delays  for  FTP,  web  and 
CBR  connections  respectively.  Table  5.4  collects  the  corresponding  performance 
metrics.  Comparing  to  Figure  5.7,  5.8  and  Table  5.3,  the  packet  loss  rate  of  FTP 
connection  is  reduced  from  2.747%  to  0.988%  at  the  cost  of  packet  delay  (increased 
from  0.184s  to  0.318s).  Since  the  average  queue  length  at  the  Adaptive  RED  queue 
is  about  80 KBytes  instead  of  60KBytes  in  the  fixed  threshold  scheme,  web  and 
UDP  packets  see  a  smaller  shared  buffer  at  the  drop-tail  queue  and  experience  a 
higher  loss  rate  from  1.278%  to  2.306%  and  from  0.300%  to  0.519%,  respectively. 
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This  situation  can  be  improved  by  increasing  the  total  buffer  size  despite  that  this 
approach  is  general  useless  or  even  harmful  in  a  single  RED  queue  or  a  single  drop- 
tail  queue.  On  the  other  hand,  the  average  delays  of  web  and  UDP  packets  are 
slightly  shorter  for  a  smaller  shared  buffer  space  at  the  drop-tail  queue.  Finally, 
Tables  5.3  and  5.4  also  show  that  the  throughputs  for  the  fixed  threshold  scheme 
and  the  dynamic  threshold  scheme  are  almost  the  same. 
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Figure  5.14:  Dynamic  threshold  scheme:  Virtual  queue  lengths  of  RED+Tail  and 
dropping  probability  of  the  Adaptive  RED  queue,  10  FTPs  starting  at  t=0  and 
800  WEBs  +  1  CBR  starting  at  t=100. 
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Figure  5.15:  Dynamic  threshold  scheme:  Packet  losses  (packets/sec.)  of 


RED + Tail. 
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Figure  5.16:  Dynamic  threshold  scheme:  Packet  delays  (sec.)  of  RED + Tail. 


131 


5.6  Summary 


In  this  chapter,  we  first  demonstrated  the  vulnerability  of  Adaptive  RED  scheme 
to  bursty  traffic  and  then  proposed  a  parallel  virtual  queue  structure  to  eliminate 
unnecessary  packet  loss.  A  simple  detection  algorithm  is  employed  to  separate 
the  short-life  and  long-life  TCP  connections  into  different  virtual  queues.  The 
packet  loss  rate  and  mean  delay  can  be  greatly  reduced  by  dynamic  bandwidth 
allocation  and  active  queue  management  with  a  parallel  queue  structure.  This 
scheme  combines  the  advantages  of  drop-tail  and  Adaptive  RED  policies.  The 
simulation  results  in  the  study  show  that  this  scheme  achieves  a  shorter  mean 
delay  for  real  time  applications  and  keeps  a  high  throughput  for  the  best  effort 
connections  as  well  as  greatly  reduces  the  packet  loss  rate  in  both  queues. 

This  parallel  queue  structure  also  provides  more  degree  of  freedom  to  con¬ 
trol  the  router  by  considering  different  bandwidth  allocation  policies  and  dynamic 
thresholds  for  Adaptive  RED.  Here,  the  bandwidth  allocation  policy  is  a  simple 
function  of  the  current  virtual  queue  length.  However,  it  is  well-known  that  web 
traffic  is  strongly  correlated  and  has  a  long  range  dependency  property.  Based  on 
observations  of  the  ’’recent  past”  traffic,  the  future  bandwidth  demand  of  the  web 
traffic  was  shown  to  be  predictable.  In  future  work,  we  will  consider  the  optimal 
bandwidth  allocation  policy  based  on  the  prediction  of  congestion  level. 
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Chapter  6 


Performance  Analysis  of  Active 
Queue  Management  in  a  Parallel 
Queue  Structure 
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Joint  analysis  of  the  parallel  queue  is  involved.  Instead  the  queues  are  analyzed 
separately  in  Section  6.1  and  6.2,  respectively.  For  the  drop-tail  queue,  the  empha¬ 
sis  is  put  on  packet  delay;  for  the  Adaptive  RED  queue  (with  dynamic  thresholds), 
the  stability  of  queue  length  is  the  primary  concern. 


6.1  Drop- Tail  Queue  with  Adaptive  Service  Rate 


First,  we  investigate  the  queuing  delay  of  CBR  and  web  traffic  at  the  drop-tail 
queue.  Note  that  the  service  rate  C\  (t)  of  this  queue  is  a  function  of  minthi , 
maxthi  and  the  current  length  of  drop-tail  queue  qi(t)  (Figure  6.1).  For  ease 
of  presentation,  we  let  minthi  =  0  and  C imax  =  C.  When  a  packet  enters  the 
drop-tail  queue  at  time  t,  it  sees  an  instant  queue  length  qi(t)  and  a  service  rate 


n  ,.S  A  .  ,  qi(t)C 
ClW  =  mm(t^rtVC)' 


(6.1) 


General  analysis  of  mean  packet  delay  is  difficult.  However,  the  dynamic  band¬ 
width  C\  (t)  is  updated  periodically  in  both  practice  and  simulation.  If  we  assume 
that  the  update  period  is  S  —  ma^hl  and  the  packets  belonging  to  the  same  slot 
have  uniformly  distributed  arriving  times,  there  exists  a  simple  lower  bound  for  the 
mean  packet  delay.  The  impulse  at  t  =  iS  in  Figure  6.2  denotes  the  new  packets 
that  arrive  uniformly  in  the  time  period  [(i  —  1  ),iS).  The  absolute  value  of  the 
slope  indicates  the  bandwidth  Ci(iS),  which  is  dependent  on  qi(iS).  If  qi(iS)  is 
less  than  maxthi ,  Ci(iS)  =  •  It  is  evident  that  each  packet  in  this  slot  has 

a  constant  delay  maxthi/ C.  If  qi(iS)  >  maxthi ,  Ci[iS )  =  C  and  the  average 
packet  delay  in  this  slot  is  larger  than  maxthi/ C.  Although  the  bound  ma//hl  is 
derived  based  on  special  assumptions,  it  appears  to  provide  a  good  approximation 
to  the  lower  bound  of  mean  packet  delay  for  general  cases  (refer  to  Figure  6.3). 
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Figure  6.1:  Dynamic  bandwidth  allocation  at  the  drop-tail  queue: 

Ci(t)=max(  Cimin,  min(  ,  Clmax)). 

The  experiment  in  Section  5.4  is  re-conducted  with  parameters  listed  in  Table 
5.2  except  maxthi  being  varied  from  10 KB  to  80 KB.  Figure  6.3  shows  the  mean 
packet  delay  of  CBR  and  web  traffic  at  the  drop-tail  queue  and  that  of  FTP 
traffic  at  the  Adaptive  RED  queue  as  maxthi  is  varied.  Note  that  the  packet 
delay  at  the  Adaptive  RED  queue  with  fixed  thresholds  is  almost  a  constant  even 
when  maxthi  is  decreasing.  That  is  because  the  Adaptive  RED  queue  has  fixed 
thresholds  (■ minth2 ,  maxthi)  and  the  average  queue  length  of  RED  queue  is  around 
q2—(minth2+maxth2) /2.  Let  Cj  denote  the  average  bandwidth  for  the  first  queue. 
Then  the  mean  delay  at  the  Adaptive  RED  queue  is  around  c^c  ■ 

For  some  real  time  applications  such  as  video  conference  and  voice,  small  delay 
jitter  is  very  important  for  the  connection  quality.  Figure  6.3  also  shows  that  the 
drop-tail  queue  has  a  very  small  delay  variance.  Note  that  the  delay  variance  at 
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Figure  6.2:  Drop-tail  queue  length  with  a  time  varying  bandwidth  C\  (t). 

the  Adaptive  RED  qnene  is  slightly  increased  when  a  smaller  value  of  maxthi  at 
the  drop-tail  queue  is  applied.  According  to  these  results,  the  mean  packet  delay 
requirements  at  both  queues  can  be  satisfied  by  properly  designing  the  values  of 
(minthi,  maxthi)  and  (minth 2,  maxthi). 
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CBR  &  WEB  at  Drop-tail  Queue 


Figure  6.3:  Mean  delays  (sec.)  of  CBR  and  WEB  packets  at  the  drop-tail  queue 
and  mean  delay  of  FTP  packets  at  the  Adaptive  RED  queue  (with  fixed  thresholds) 
with  maxthi—  10,  20,. ..,80  (KBytes). 
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6.2  Adaptive  RED  Queue  with  Dynamic  Thresh¬ 
olds 

In  Chapter  5  we  proposed  the  modified  Adaptive  RED  scheme  with  dynamic 
thresholds  in  the  parallel  queue  structure  for  controlling  the  flow  rate  of  non- 
real  time  applications.  The  maximum  threshold  maxt}i2  and  minimum  threshold 
minth2  are  changed  dynamically  to  keep  the  packet  dropping  probability  Pd  within 
a  desired  small  region  (Pi,  Pu)  at  the  cost  of  packet  delay  variation.  In  this  sec¬ 
tion  we  analyze  issues  related  to  the  stability  of  this  virtual  queue.  For  ease  of 
analysis,  it  is  assumed  that  the  dropping  probability  Pd  of  the  Adaptive  RED  at 
the  bottleneck  router  is  fixed  so  that  the  average  flow  rate  of  each  TCP  connection 
can  be  approximated  by  a  simple  function  of  its  round-trip  time.  Note  that  this 
assumption  is  not  very  restrictive  considering  the  interval  ( Pl ,  Pjj)  is  small. 

Consider  N  persistent  TCP  flows.  To  simplify  analysis,  it  is  assumed  that  the 
service  rate  C2  of  the  Adaptive  RED  queue  is  constant,  C2  —  C  —  C'i,  where  C\ 
is  the  average  bandwidth  for  the  drop-tail  queue.  Define  Tk  as  the  average  flow 
rate  of  the  kth  TCP  connection  during  time  slot  n.  Let  d'k  be  the  link  round- 
trip  propagation  delay  of  connection  k.  At  the  beginning  of  time  slot  n  the  kth 
connection  sees  a  round-trip  time  Rk,  which  is  equal  to  the  sum  of  link  propagation 
delay  and  the  average  queuing  delay  in  the  forward  direction  qn/C2  and  in  the 
backward  direction  qfi / C2 : 

=  *  +  £  +  £,  (6.2) 

where  qn  and  q£  are  the  forward  queue  length  and  the  backward  queue  length  (of 
this  Adaptive  RED  queue)  at  the  beginning  of  time  slot  n,  respectively.  We  assume 
that  congestion  only  happens  in  the  forward  direction  and  the  queuing  delay  q^  /C2 
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in  the  backward  direction  is  a  constant.  Hence  we  can  write  Rk  =  dk  +  qn  /C2  with 

4  =  4  +  Qb/C 2- 

Based  on  the  assumption  of  fixed  dropping  probability  at  the  router,  each  TCP 
connection  experiences  a  fixed  packet  loss  rate  Pd  and  the  corresponding  average 
congestion  window  size  is  assumed  to  be  a  constant  W .  Hence,  the  average  flow 
rate  Tk  of  the  kth  TCP  connection  at  slot  n  is 

T£  =  Z;  +  Enk  (6.3) 

where  Ek  is  a  white  Gaussian  process  with  zero  mean  and  variance  a2  modeling 
the  flow  rate  perturbation  of  the  kth  connection  at  slot  n. 

Given  the  arrival  rate  of  each  TCP  connection,  the  dynamics  of  queue  length 
qn  follows  the  Bindley  equation: 

N 

qn+ 1  =  min{B,  max[ 0,  qn  +  T”  -  C2)^]},  (6.4) 

k= 1 

where  B  is  the  buffer  size  and  S  is  the  duration  of  one  time  slot.  We  list  the 
parameter  definitions  as  follows: 

•  TJ!  :  average  flow  rate  of  TCP  connection  k  at  time  slot  n 

•  :  perturbation  of  flow  rate  (modeled  by  a  white  Gaussian  process  </P(0,  a2) 

•  W :  average  congestion  window  size 

•  C2  '■  link  bandwidth 

•  dk  :  link  round-trip  propagation  delay  d!k  +  backward  queuing  delay  qb/C2 

•  qn\  forward  link  queue  length  at  the  beginning  of  time  slot  n 

•  S  :  duration  of  one  time  slot 
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•  B  :  buffer  size 


•  N  :  number  of  TCP  connections 

Since  the  queue  length  of  Adaptive  RED  is  mostly  operated  in  a  region  far 
from  the  boundary,  we  first  ignore  the  max  and  min  operations  in  (6.4)  and  have 
a  simplified  nonlinear  dynamic  system: 


9”+1  =  f(q")+C, 


(6.5) 


where 


and 


N 

f(qn)=qn  +  S{(J2 

k=  1 


WC2 
qn  +  dkC2 


)  - 


(6.6) 


N 

e=sJ2Ek-  (6.7) 

k= 1 

To  avoid  the  trivial  case  q  =  0,  we  assume  that  the  sum  of  possible  peak  rates 
of  all  connections  is  greater  than  the  link  bandwidth  at  the  bottleneck  router: 


Figure  6.4:  Queue  length  and  TCP  throughput  (of  a  single  connection)  with 
CY2=6Mbps,  dk= 64ms,  hF=6.02xl04  bits.  Compare  with  simulation  in  Fig. 5. 12. 
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Figure  6.5:  Mapping  functions  and  equilibrium  points  when  N=20,  40  with 
S=RTT. 
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Figure  6.4  shows  the  queue  length  dynamics  (and  the  throughput  of  a  persistent 
TCP  connection)  based  on  the  model  (6.5),  where  the  flow  rate  deviations  a  = 
77021,  128490  (bits/s)  for  1V=20,  40  are  measured  from  the  simulation  in  Chapter 
5,  respectively.  For  both  N  —  20  and  N  =  40,  Figure  6.4  shows  consistent  steady 
state  behavior  with  simulation  results  in  Figure  5.12.  The  mapping  /(•)  is  plotted 
in  Figure  6.5  for  IV  =  20  and  N  =  40. 

We  first  analyze  the  stability  of  the  equilibrium  of  the  model  (6.5)  when  there 
is  no  flow  disturbance.,  i.e.,  E%  =  0.  An  equilibrium  qe  of  qn+1  =  f(qn )  should 
satisfy 


N 

£ 


W 


=  a. 


(6.9) 


^  dk  +  Qe/Cz 

Now,  we  show  the  mapping  function  /  is  strictly  convex  and  qe  is  the  unique 
solution.  According  to  (6.6),  we  have  the  first  and  second  derivatives  of  /: 

f\q )  =  1  +  SWC2  £f=1  <  1,  V  0  <  q  <  oo  (6.10) 

/"(<?)  =  2AWC2  £f=1  >  0,  V  0  <  g  <  oo.  (6.11) 


Note  that  /,  f'  and  f"  are  continuous  and  differentiable  for  all  0  <  q  <  oo. 


Lemma  6.2.1  /  is  strictly  convex  for  all  q  >  0. 

Proof  Since  c4  >  0,  according  to  (6.11)  we  have  f"  >  0  for  allO  <  q  <  oo,  which 
indicates  that  f  is  a  strictly  convex  function. 

Equation  (6.10)  indicates  that  the  slope  of  /  is  less  than  1  for  all  0  <  q  <  oo. 
Lemma  6.2.2  qe  is  unique  in  the  region  (0,  oo). 

Proof  Since  Ylk=i  —  ^2  by  assumption,  (6.9)  has  a  solution  qe  in  [0,oo).  qe 
is  located  at  the  intersection  of  the  graph  of  f  with  the  45°  line  (see  Figure  6.5). 
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Furthermore,  /'  <  1  implies  there  is  no  other  intersection  in  (ge,  oo)  and  shows  q, 
is  unique  in  the  region  (0,  oo). 


It  is  well  known  that  qe  is  locally  asymptotically  stable  if  |/'(ge)|  <  1-  hi  the 
following  we  give  conditions  for  qe  to  be  globally  asymptotically  stable. 


Proposition  6.2.3  If  the  rate  update  interval  S  satisfies 

9C! 

S  <  - — ^ ,  (6.12) 

42) 

the  equilibrium  qe  is  globally  asymptotically  stable.  Furthermore,  \qn  —  qe\  <  pn\q°  — 
qe  |  for  some  p  G  (0, 1)  dependent  on  q°. 

Proof  First  we  observe  that  the  function  /  is  convex  since 

N 


f"(q)  =  7~r~ 7  7W4  >  °>  Vg  G  [0,  oo). 


t!  (9  +  d kC2) 

For  any  B0  such  that  B0  >  qe  and 


(6.13) 


"  w 


Bo  >  m  =£7*  C9S. 


(6.14) 


k= 1 


one  can  verify  that  /  maps  [0,  B0]  to  [0,  B0]  due  to  convexity  of  /  and 


N 


rtq)  =  i-sj2 


WC2 


<1,  Vg  G  [0,  oo). 


(6.15) 


(q  +  dkc2y 

When  restricted  to  [0,  B0\,  /'(g)  <  pi  with  p1  G  (0, 1).  If  (6.12)  is  satisfied,  /'(g)  > 
-1,  Vg  G  [0,  oo],  and  /'(g)  >  -p2,  Vg  G  [0,  B0],  with  p2  G  (0, 1). 

Hence  |/'(g)|  <  p  =  max(pi ,  p2)  <  1  Vg  G  [0,  Bo],  which  implies  that  /  is  a 
contraction  mapping  on  [0,  _B0].  By  the  Contraction  Mapping  Principle  [98], 


\qn  -  qe\  <  Pn\q°  ~  qe\,  if  q°  G  [0,  B0\. 


(6.16) 
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Since  B0  can  be  arbitrarily  large  (as  long  as  B0  <  oo),  qe  is  globally  asymptotically 
stable.  Note  that  the  contraction  constant  p  depends  on  B0  and  thus  on  q°.  □ 
From  Proposition  6.2.3,  when  rate  update  is  frequent  enough,  the  equilibrium 
will  be  asymptotically  stable  (the  equilibrium  itself  does  not  depend  on  S' ).  An¬ 
other  sufficient  condition  for  asymptotic  stability  is  the  following: 

Proposition  6.2.4  If  f'(qe)  >  0,  then  qe  is  a  globally  asymptotically  stable. 
Proof  As  shown  in  the  proof  of  Lemma  6.2.1,  f  is  strictly  convex.  If  f'(qe)  >  0, 
graphical  analysis  revels  that 

k"+'-9e|<k"-9e|, 

where  the  equality  holds  if  and  only  if  qn  =  qe.  The  claim  thus  follows.  □ 

For  the  homogeneous  case  c4  =  d,  we  have  qe  =  NW  —  dC2.  And  the  condition 
f'(qe)  >  0  is  equivalent  to  S'  <  -^P-  =  qe/C2+d.  In  other  words,  qe  is  asymptotically 
stable  if  the  rate  update  interval  S  is  no  larger  than  the  round-trip  time  (RTT). 
Figure  6.10  shows  the  mapping  /  and  the  equilibrium  qe  for  different  S.  Figure 
6.7  shows  the  queue  length  dynamics  (noise  is  included)  for  S=RTT  and  2 RTT, 
respectively.  We  can  see  that  in  the  case  S=RTT ,  the  queue  length  stays  around 
qe  with  small  variation,  while  in  the  case  S=2RTT ,  the  queue  length  dynamics  is 
much  more  chaotic. 

For  the  heterogeneous  case,  a  sufficient  condition  S'  <  )-1  for 

stability  can  be  derived  as  follows: 

Proposition  6.2.5  For  any  dk  >  0,  the  system  is  stable  if  S  <  (^  — 

Proof  According  to  Proposition  6.2.  f,  we  want  to  find  the  condition  of  S  so  that 
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f  '(qe )  >  0  is  satisfied.  Let  Dk  dkC2  and  G  :=  WC2,  from  (6.9)  we  have: 


E1  _  ^2 

k=i  G 

from  (6.10),  we  want  to  show  f  \qe )  >  0  by  showing  that 


V  >  1  1 

h  We  +  Dk)2  -  SG ■ 

Substitute  (6.17)  in  (6.18),  we  have 

Cl  ^  1  1  J_ 

G2  qe  +  Di  qe  +  Dj  S G 

*7=7 


(6.17) 


(6.18) 


(6.19) 


Since  qe>  0  and  Dk  >  0,  a  sufficient  condition  of  the  above  inequality  is: 


Cl 

G2 

^S  <  ( 


N(N-  1)  ^  1 
'  (qe  +  Dm f  ~  SG 
C2  N(N  —  1)  ! 

W  ( qe  +  Dm)2)  ’ 


(6.20) 

(6.21) 


where  Dm  :=  ruin  Dk .  Hence  S  <  ^2 )  1  is  a  sufficient  condition  for 

stability.  □ 


Similar  to  the  homogeneous  case,  if  the  rate  update  interval  S  is  short  enough, 
we  can  guarantee  the  stability  in  the  heterogeneous  case. 
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Figure  6.6:  Mapping  function  and  equilibrium  point  when  N=40  with  S=0.5RTT, 
1RTT  and  2RTT. 
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Figure  6.7:  Queue  length  with  N=40,  S=RTT  and  2RTT. 
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Next  we  consider  the  Lindley  equation  with  finite  random  perturbation  = 

SThli?: 

9,,+1  =  9(qn, C)  =  min{s,  max[0,  /(«“)  +  {’*]}  (6.22) 

Note  that  since  {E'f}  is  white  and  stationary,  so  is  {£"}.  It  turns  out  that  stability 
of  the  equilibrium  of  the  deterministic  system  qn+1  =  f(qn)  is  closely  related  to 
stochastic  stability  of  the  system  (6.22). 

Define  a  compact  set  X:  =  [0,  B ]  and  the  transformation  g: X  xW  — »  X  of  (6.22) 
is  called  regular  stochastic  dynamic  system  in  [99],  if  the  following  assumptions 
hold. 

(a)  The  random  vectors  £°,  ...,  have  value  in  W  and  have  the  same  distribution. 

(b)  g  is  defined  on  the  subset  XxW  of  RdxRk.  The  set  X  C  Rd  is  closed  and 

W  C  Rk  is  Borel  measurable.  For  every  fixed  yEW  the  function  g(x,y)  is 
continuous  in  x  and  for  every  fixed  xEX  it  is  measurable  in  y. 

(c)  The  initial  random  vector  a;0  has  initial  value  in  A" and  the  vectors  x°,  £°,  ^1, ..., 

are  independent. 

Definition  6.2.6  [99]  (Foias  operator^)  Let  a  function  g:X  x  W  satisfies  condi¬ 
tion  (b)  and  a  probabilistic  measure  (supported  onW )  be  given.  Then  the  operator 
P:  Mfin  ->  Mfin  given  by 

P/i(A)  =  [  {[  1  A(g(x,y))u(dy)}fi(dx)  (6.23) 

Jx  Jw 

will  be  called  the  Foias  operator  corresponding  to  the  dynamic  system.  Mfin 
denotes  the  subspace  of  finite  measures.  □ 
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Theorem  6.2.7  [99]  Krylov -Bogolubov  Theorem  Let  P  be  the  Faias  operator 
to  a  regular  stochastic  dynamical  system.  Assume  that  there  is  a  p$  E  Mi  having 
the  following  property.  For  every  e  >  0  there  is  a  bounded  set  B  e  B(X)  such  that 

Pn(B)  =  Pnp0(B)  >  1  -  £  for  n  =  0, 1,  2, ....  (6.24) 

Then  P  has  an  invariant  distribution.  □ 

Proposition  6.2.8  The  stochastic  system  (6.22)  admits  an  invariant  probability 
measure  p*  for  the  queue  length  qn.  Furthermore,  if  the  condition  (6.12)  on  Propo¬ 
sition  6.2.3  is  satisfied,  this  system  is  weakly  asymptotically  stable,  i.e.,  the  queue 
length  distribution  pn  for  qn  converges  to  p*  weakly. 

Sketch  of  Proof.  Since  f  is  continuous  and  {£n}  is  identically  and  independently 
distributed,  the  system  (6.22)  is  a  regular  stochastic  dynamic  system. 

Since  [0,  B]  is  compact,  the  system  admits  an  invariant  probability  measure  p* 
by  the  Krylov -Bogolubov  Theorem.  When  condition  (6.12)  is  satisfied,  g  is  a 
contraction  mapping  with  respect  to  its  first  argument,  i.e., 

\g(x,0  -  g(y,0\ <  p\%  -  y\>  Vx,ye[ o,£],Vf,  (6.25) 

where  p  G  (0,1).  Flence  the  system  is  weakly  asymptotically  stable  by  Theorem 
12.6.1  of  [99].  □ 

The  invariant  probability  measure  p*  has  probability  masses  at  q  —  0  and 
q  =  B,  and  has  probability  density  on  (0 ,B).  An  approximation  to  p*  can  be 
obtained  by  numerically  advancing  the  probability  distribution  pn  for  the  queue 
length  qn.  We  have  discretized  the  queue  length  and  consequently  obtained  a 
Markov  chain  for  the  dynamics  of  the  queue  length  distribution. 

Let  the  packet  size  have  a  fixed  length  L  (bits),  zn:—ceil{qn / L)  be  the  number 
of  packets  in  the  queue  at  time  n  and  7 xn  =  [ Pr(zn  =  0),  ...,Pr(zn  =  B)}  denote 
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the  corresponding  probability  vector.  We  have 

7 Tn+1  =  7 rnT,  (6.26) 

Ti*  =  t t*T,  (6.27) 

where  7r*  =  lim^^  7r"  is  the  steady  state  distribution  and 

T(i,j)  :=  Pr[zn+1  =  j\zn  =  i]  (6.28) 

is  the  corresponding  transition  matrix  of  the  Markov  chain.  The  conditional  prob¬ 
ability  Pr[zn+1  =  j\zn  =  i]  is  obtained  as 

Pr\j  <  (min{B,max[ 0,  f(iL )  +  £]})/L  <  (j  +  1)].  (6.29) 

On  the  other  hand,  when  the  buffer  size  B  is  far  greater  than  the  equilibrium 
queue  length  and  the  perturbation  magnitude  is  small,  the  transformation  g(q,£) 
can  be  linearized  around  the  equilibrium  point  qe.  Let  Qn  =  qn  —  qe.  Then 

Qn+1=  f'(qe)Qn  +  C ■  (6.30) 

Since  {£n}  is  white  Gaussian  process  with  zero  mean  and  variance  NSa 2,  {Qn} 
will  be  a  Gaussian  process  with  zero  mean  and  normalized  variance 

Var[QVS]  =  TT7TF-  (6'31) 

From  (6.31)  the  normalized  queue  length  variation  will  be  minimal  if  f  '(qe )  =  0, 
which  corresponds  to  S  —  RTT  for  the  homogeneous  case. 

Figure  6.8  and  6.9  show  the  queue  length  distributions  obtained  through  empir¬ 
ical  estimation  from  ns2  simulation,  numerical  computation  based  on  (6.26),  and 
linear  approximation  based  on  (6.31),  repectively.  Three  distributions  agree  well, 
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which  verifies  that  our  nonlinear  model  (6.22)  captures  the  queue  length  dynamics 
under  the  Adaptive  RED  scheme  with  dynamic  thresholds. 

Figure  6.10  6.11  and  6.12  illustrate  the  Mapping  functions,  equilibrium  points, 
queue  length  dynamics  from  N  =  20,  40  to  8. 
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Figure  6.8:  Steady  state  queue  length  distributions  for  N=20,  S=RTT. 
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Figure  6.9:  Steady  state  queue  length  distributions  for  N=40,  S=RTT. 
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Figure  6.10:  Mapping  function  and  equilibrium  point  when  N=20,  40  and  8  with 
S=0.5RTT,  1RTT  and  2RTT. 
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Figure  6.11:  Queue  length  dynamics  N  =  20  — >  40  — >  8. 
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Figure  6.12:  Queue  length  with  N  =  20  — >  40  — >•  8. 
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6.3  Summary 


In  Chapter  5  and  6  we  have  first  demonstrated  the  vulnerability  of  Adaptive  RED 
scheme  to  bursty  web  traffic,  and  then  proposed  a  parallel  virtual  queue  structure 
for  active  queue  management  at  the  router.  A  simple  detection  algorithm  is  em¬ 
ployed  to  separate  the  short-life  and  long-life  TCP  connections  into  different  virtual 
queues.  The  packet  loss  rate  and  mean  delay  for  short-life  traffic  can  be  greatly 
reduced  by  dynamic  bandwidth  allocation  with  this  parallel  queue  structure.  This 
scheme  combines  the  advantages  of  drop-tail  and  Adaptive  RED  policies.  The 
simulation  results  in  the  study  show  that  this  scheme  achieves  a  shorter  mean 
delay  for  real  time  applications  and  keeps  a  high  throughput  for  the  best  effort 
connections  as  well  as  greatly  reduces  the  packet  loss  rate  in  both  queues. 

This  parallel  virtual  queue  structure  also  offers  more  degrees  of  freedom  for 
AQM  due  to  its  flexibility  in  accommodating  variants  of  the  Adaptive  RED  scheme 
and  different  dynamic  bandwidth  allocation  algorithms.  We  have  explored  a  mod¬ 
ified  Adaptive  RED  scheme  with  sliding  queue  length  thresholds.  This  scheme 
is  able  to  maintain  the  dropping  probability  within  a  small  interval  and  improve 
the  goodput  of  non-real  time  connections.  The  queue  length  variation  under  this 
policy  has  been  analyzed  and  conditions  for  its  stability  have  been  given.  The 
dynamic  threshold  Adaptive  RED  might  also  be  useful  for  achieving  throughput 
fairness  among  multiple  RED  queues. 

As  to  the  dynamic  bandwidth  allocation  policy  for  the  drop-tail  queue,  we  only 
used  the  current  virtual  queue  length  information.  However,  it  is  well-known  that 
web  traffic  is  strongly  correlated  and  has  a  long  range  dependency  property.  Based 
on  observations  of  the  “recent  past”  traffic,  the  future  bandwidth  demand  of  the 
web  traffic  is  predictable.  In  future  work  optimal  bandwidth  allocation  based  on 
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prediction  of  the  congestion  level  will  be  explored. 
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Chapter  7 


Conclusions 


In  this  dissertation,  we  first  reviewed  the  theory  of  wavelet  analysis  for  network 
traffic.  We  also  discussed  the  monofractal  and  multifractal  behaviors  of  Internet 
traffic  at  large  and  small  time  scales.  In  order  to  capture  these  mono/multi-fractal 
properties  at  all  time  scales,  we  proposed  a  multilevel  ON/OFF  model  for  the 
Internet  traffic  and  developed  an  algorithm  to  estimate  the  model  parameters  from 
a  real  trace.  The  idea  of  this  model  is  to  imitate  the  TCP  packet  arrival  pattern  at 
the  lower  level  and  the  connection  arrival  pattern  at  the  upper  level.  A  synthetic 
traffic  was  generated  by  the  proposed  model  with  the  parameters  estimated  from 
a  real  trace.  The  wavelet  analysis  showed  that  this  new  model  can  successfully 
capture  the  statistical  properties  in  second  order  (Logscale  diagram)  and  higher 
orders  (multifractal  spectrum).  The  simulation  results  also  showed  that  these  two 
traffic  share  the  same  queuing  behavior. 

Since  the  Logscale  diagram  carries  important  statistical  properties  of  the  traf¬ 
fic  at  all  time  scales,  we  developed  an  approximation  of  Logscale  diagram  of  the 
Internet  traffic.  The  goal  is  to  compute  the  Logscale  diagram  directly  from  the 
network  parameters  of  the  model  instead  of  analyzing  a  long  trace.  Given  the 
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Logscale  diagram  of  the  input  traffic,  one  can  immediately  transfer  the  Logscale 
diagram  to  the  traffic  workload  distribution  at  all  time  scales  and  predict  the  cor¬ 
responding  queuing  behavior  and  network  performance.  Furthermore,  our  analysis 
results  show  that  the  queuing  behavior  with  a  small  buffer  size  is  dominated  by  the 
traffic  behaviors  at  small  time  scales.  It  indicates  that  the  traditional  monofractal 
(self-similar)  traffic  model  is  not  suitable  for  predicting  the  performance  in  real 
networks. 

For  some  real  time  applications  such  as  Constant  Bit  Rate  (CBR)  connections 
are  sensitive  to  the  mean  and  variance  of  packet  delay.  By  applying  our  predicted 
results  of  queue  length  distribution  and  the  properties  of  wavelet  analysis,  we 
developed  a  fast  algorithm  to  estimate  the  mean  and  variance  of  packet  delay 
in  real  time.  This  tool  could  help  the  network  resource  providers  decide  how 
much  bandwidth  should  be  allocated  to  guarantee  the  quality  of  service  for  certain 
applications. 

In  the  second  part  of  this  thesis,  we  are  focusing  on  the  policy  of  buffer  manage¬ 
ment.  Many  Active  Queue  management  schemes  such  as  RED  were  proposed  to 
improve  the  TCP  throughput  and  link  utilization.  The  basic  assumption  of  these 
policies  is  assuming  that  the  TCP  is  operated  in  the  Congestion  Avoidance  phase. 
However,  the  current  Internet  traffic  is  dominated  by  web  traffic  and  most  web 
connections  are  operated  in  the  slow  start  phase.  We  indicated  that  these  AQM 
policies  implemented  in  a  shared  queue  is  not  suitable  for  the  current  Internet  traf¬ 
fic.  The  simulation  results  demonstrated  that  the  web  traffic  severely  degrades  the 
performance  of  those  AQM  schemes.  Thus,  we  proposed  a  parallel  queue  structure 
for  a  better  solution  of  buffer  management.  Since  TCP  connection  is  untamed  in 
the  slow  start  phase,  we  invented  a  detection  algorithm  to  prevent  the  untamed 
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TCP  from  entering  into  the  AQM  queue.  In  addition,  the  parallel  structure  also 
gives  us  more  freedom  to  control  the  AQM  scheme.  A  typical  AQM  scheme  such 
as  RED  relies  on  dropping  packets  actively  to  control  the  TCP  flow  rate.  This 
method  wastes  bandwidth  resources  and  caused  unnecessary  re-transmissions.  For 
non-real  time  application  such  as  transmitting  a  large  data  hie  by  FTP,  packet 
delay  is  not  an  important  issue.  Under  this  structure,  we  have  the  freedom  to 
control  the  how  rate  of  non-real  time  application  by  increasing  the  queuing  delay 
instead  of  dropping  the  packet. 
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Appendix  A 


Appendix 


A. 0.1 


Proposition^. 3. 3 


lim  IM=E\V]B. 

i^oo  Y(t) 

Proof:  Since  there  are  N  independent  user  connections,  we  let  N  =  1  without  loss 
of  generality. 


lim  m 

t^OO  Y(t) 


lim 

t— XX 


B  lim 

t— XX 


B  lim 

t— XX 


B  /q  Uk(u)Vk(u)du 
fo  Uk(u)du 

E'=i  fti—i  Uk(u)Vk(u)du  +  ftm  Uk(u)Vk(u)du 
Y7=1  Itii  uk(u)du  +  ftm  Uk{u)du 
D=i  tiL  yk(u)du  +  //m  Vk{u)du 

Ya= i(^  -^-l)  +  £  -tm 


B  lim  /» 

t'—>oo 


f 


=  B- 


ET, 


21 


ET‘2\  +  ET2o 
=  BE[V] 


(A.l) 
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where  U-\  and  t,t  are  the  beginning  and  end  points  of  the  ON  state,  tm  is  the 
maximal  end  point  which  is  less  than  t,  and  V'(t)  is  a  shifted  version  ofV{t). 


A. 0.2 

Proposition  3.3.6: 

Without  loss  of  generality,  let  A  =  1  and  a2  =  EG2  =  1.  From  (2.32), 
Ra(k)  =  EG0EGk 

=  E{BH(l)~BH(0))(BH(K  +  l)-BH(k)) 

=  Cov(BH(l)BH(k  +  1))  -  Cov(BH(l)BH(k)) 

=  ^(\k  +  l\2H -2\k\2H +  \k-l\2H) 

and 

lim  Rc(k ) 

k — >oo 

=  lim  l(\k  +  l\2H  -  2\k\2H  +  \k  -  l\2H) 

=  lim  -k2H~2[k2(\l  +  r\2H  -  2  +  |1  -  h2H)} 
k — >oo  2  k  k 

Note  that 

,  (1  +  x)2H  —  2  +  (1  —  x)2H 

lim - 5 - 

x — >0 

2H(1  +  x)2H~l  -  2H(1  -  x)211-1 
=  lim - 

z— >o  2x 

2H{2H  -  1)(1  +  x)2H~2  +  2H{2H  -  1)(1  -  x)2H~ 2 
x^o  2 

=  2H(2H  —  1) 


(A.2) 
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A. 0.3 


Show  equations  (4.54)  and  (4.55)  with  a  Lognormal  distribution  /^(x): 


and 


where 


„  .  ,  1  r  (lnx  —  u)2 

Ja(x):= - 7^=  exp[— - — ],  x  >  0 
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where 


x2f(x)dx  = 
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